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TITLE 

PLANT CELLULOSE SYNTHASES 



This application claims the benefit of U.S. Provisional Application No. 60/092.844. 
filed July 14, 1998. 

FIELD OF THE INVENTION 
This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding cellulose biosynthetic enzymes in 
plants and seeds. 

BACKGROUND OF THE INVENTION 
Cellulose is a major component of plant fiber, e.g. cotton fiber. Cellulose is 
composed of crystalline beta-1.4-glucan microfibrils (see World Patent Publication 
No. WO 98/00549). These microfibrils are strong and can resist enzymatic and mechanical 
degradation and are important in determining nutritional quality of animal and human 
foodstuffs. Hence, modification of the biosynthetic pathway responsible for cellulose 
synthesis throueh modification of cellulose synthase activity could potentially alter fiber 
quantity 7 , either by producing more or less fiber in a particular plant species or in a specific 
organ or tissue of a particular plant. Modification of cellulose synthase activity could 
increase the value of the fiber to the end-user and may improve the structural integrity of the 
plant cell wall. Lastly, because cellulose is a major cell wall component, inhibition of 
cellulose synthesis would probably be lethal. Thus, cellulose synthase may serve as the 
target for a novel class of herbicides. Plant cellulose synthase genes, homologs of the 
bacterial celA genes encoding the catalytic subunit of cellulose synthase, have been reported 
from cotton. Arabidopsis. rice and alfala (World Patent Publication Nos. WO 98/00549 and 
WO 98/18949). 

There is a great deal of interest in identifying the genes that encode proteins involved 
in cellulose synthesis. These genes may be used in plant cells to control the synthesis of 
cellulose. Accordingly, the availability of nucleic acid sequences encoding all or a portion 
of a cellulose synthase would facilitate studies to better understand cellulose synthesis in 
plants and provide genetic tools to alter cellulose production. 

SUMMARY OF THE INVENTION 
The instant invention relates to isolated nucleic acid fragments encoding cellulose 
biosynthesis enzymes. Specifically, this invention concerns an isolated nucleic acid 
fragment encoding a cellulose synthase and an isolated nucleic acid fragment that is 
substantially similar to an isolated nucleic acid fragment encoding a cellulose synthase. In 
addition, this invention relates to a nucleic acid fragment that is complementary to the 
nucleic acid fragment encoding cellulose synthase. An additional embodiment of the instant 
invention pertains to a polypeptide encoding all or a substantial portion of a cellulose 
synthase. 
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In another embodiment, the instant invention relates to a chimeric gene encoding a 
cellulose synthase, or to a chimeric gene that comprises a nucleic acid fragment that is 
complementary to a nucleic acid fragment encoding a cellulose synthase, operably linked to 
suitable regulatory sequences, wherein expression of the chimeric gene results in production 
5 of levels of the encoded protein in a transformed host cell that is altered (i.e., increased or 
decreased) from the level produced in an untransformed host cell. 

In a further embodiment, the instant invention concerns a transformed host cell 
comprising in its genome a chimeric gene encoding a cellulose synthase, operably linked to 
suitable regulatory sequences. Expression of the chimeric gene results in production of 

10 altered levels of the encoded protein in the transformed host cell. The transformed host cell 
can be of eukaryotic or prokaryotic origin, and include cells derived from higher plants and 
microorganisms. The invention also includes transformed plants that arise from transformed 
host cells of higher plants, and seeds derived from such transformed plants. 

An additional embodiment of the instant invention concerns a method of altering the 

15 level of expression of a cellulose synthase in a transformed host cell comprising: 

a) transforming a host cell with a chimeric gene comprising a nucleic acid fragment 
encoding a cellulose synthase; and b) growing the transformed host cell under conditions 
that are suitable for expression of the chimeric gene wherein expression of the chimeric gene 
results in production of altered levels of cellulose synthase in the transformed host cell. 

20 An addition embodiment of the instant invention concerns a method for obtaining a 

nucleic acid fragment encoding all or a substantial portion of an amino acid sequence 
encoding a cellulose synthase. 

A further embodiment of the instant invention is a method for evaluating at least one 
compound for its ability to inhibit the activity of a cellulose synthase, the method comprising 

25 the steps of: (a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a cellulose synthase, operably linked to suitable regulatory sequences; 
(b) growing the transformed host cell under conditions that are suitable for expression of the 
chimeric gene wherein expression of the chimeric gene results in production of cellulose 
synthase in the transformed host cell; (c) optionally purifying the cellulose synthase 

30 expressed by the transformed host cell; (d) treating the cellulose synthase with a compound 
to be tested; and (e) comparing the activity of the cellulose synthase that has been treated 
with a test compound to the activity of an untreated cellulose synthase, thereby selecting 
compounds with potential for inhibitory activity. 

BRIEF DESCRIPTION OF THE 

35 DRAWINGS AND SEQUENCE DESCRIPTIONS 

The invention can be more fully understood from the following detailed description 
and the accompanying drawings and Sequence Listing which form a part of this application. 
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Figure 1 shows a comparison of the amino acid sequences set forth in SEQ ID NOs:2, 
4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thaliana sequences (SEQ ID NOs:23 
(gi 2827139), 24 (gi 2827141), 26 (gi 4467125), 27 (gi 4886756) and 29 (gi 3135611)) and 
Gossypium hirsutum sequences (SEQ ID NOs:25 (gi 1706958) and 28 (gi 5081779)). 
5 Table 1 lists the polypeptides that are described herein, the designation of the cDNA 

clones that comprise the nucleic acid fragments encoding polypeptides representing all or a 
substantial portion of these polypeptides, and the corresponding identifier (SEQ ID NO:) as 
used in the attached Sequence Listing. The sequence descriptions and Sequence Listing 
attached hereto comply with the rules governing nucleotide and/or amino acid sequence 
10 disclosures in patent applications as set forth in 37 C.F.R. §1.821-1.825. 





TABLE 1 








Cellulose Biosynthetic Enzymes 








SEQ ID NO: 


Protein 


Clone Designation 


(Nucleotide) 


(Amino Acid) 


Cellulose Synthase 


bshl.pk0002.f6 


1 


2 


Cellulose Synthase 


Contig composed of: 
ccoln.pk0005.g3 
cdt2c.pk002.gl 
cdt2c.pk002.116 
csclc.pk002.il 
p0031.ccmar05rb 
p0110.cgsma57r 




4 


Cellulose Synthase 


crln.pk0135.el0 


5 


6 


Cellulose Synthase 


p0097.cqradl7rc 


7 


8 


Cellulose Synthase 


p0122.ckamh70rc 


9 


10 


Cellulose Synthase 


rlr24.pk0073.gl 


11 


12 


Cellulose Synthase 


sdp2c.pk005.o22 


13 


14 


Cellulose Synthase 


ses8w.pk0028.G 


15 


16 


Cellulose Synthase 


ssl.pk0036.cl0 


17 


18 


Cellulose Synthase 


Contig composed of: 
wll.pk0009.c9 
wrl.pk0160.dll 
wreln.pk0043.f9 


19 


20 




wreln.pk0043.h8 
wreln.pk0131.gl0 












Cellulose Synthase 


wlln.pk0044.bl 


21 


22 



15 The Sequence Listing contains the one letter code for nucleotide sequence characters 

and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB 
standards described in Nucleic Acids Research 75:3021-3030 (1985) and in the Biochemical 
Journal 219 (No. 2):345-373 (1984) which are herein incorporated by reference. The 
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symbols and format used for nucleotide and amino acid sequence data comply with the rules 
set forth in 37 C.F.R. §1.822. 

DETAILED DESCRIPTION OF THE INVENTION 
In the context of this disclosure, a number of terms shall be utilized. As used herein, a 
5 "nucleic acid fragment" is a polymer of RNA or DNA that is single- or double-stranded, 
optionally containing synthetic, non-natural or altered nucleotide bases. A nucleic acid 
fragment in the form of a polymer of DNA may be comprised of one or more segments of 
cDNA, genomic DNA or synthetic DNA. 

As used herein, "contig" refers to a nucleotide sequence that is assembled from two or 

10 more constituent nucleotide sequences that share common or overlapping regions of 

sequence homology. For example, the nucleotide sequences of two or more nucleic acid 
fragments can be compared and aligned in order to identify common or overlapping 
sequences. Where common or overlapping sequences exist between two or more nucleic 
acid fragments, the sequences (and thus their corresponding nucleic acid fragments) can be 

15 assembled into a single contiguous nucleotide sequence. 

As used herein, "substantially similar" refers to nucleic acid fragments wherein 
changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the functional properties of the polypeptide encoded by the nucleotide 
sequence. "Substantially similar" also refers to nucleic acid fragments wherein changes in 

20 one or more nucleotide bases does not affect the ability of the nucleic acid fragment to 

mediate alteration of gene expression by gene silencing through for example antisense or co- 
suppression technology. "Substantially similar" also refers to modifications of the nucleic 
acid fragments of the instant invention such as deletion or insertion of one or more 
nucleotides that do not substantially affect the functional properties of the resulting 

25 transcript vis-a-vis the ability to mediate gene silencing or alteration of the functional 
properties of the resulting protein molecule. It is therefore understood that the invention 
encompasses more than the specific exemplary nucleotide or amino acid sequences and 
includes functional equivalents thereof. 

For example, it is well known in the art that antisense suppression and co-suppression 

30 of gene expression may be accomplished using nucleic acid fragments representing less than 
the entire coding region of a gene, and by nucleic acid fragments that do not share 1 00% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given 
site, but do not effect the functional properties of the encoded polypeptide, are well known in 

35 the art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 

substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result 
in substitution of one negatively charged residue for another, such as aspartic acid for 
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glutamic acid, or one positively charged residue for another, such as lysine for arginine, can 
also be expected to produce a functionally equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the polypeptide molecule 
would also not be expected to alter the activity of the polypeptide. Each of the proposed 
5 modifications is well within the routine skill in the art, as is determination of retention of 
biological activity of the encoded products. 

Moreover, substantially similar nucleic acid fragments may also be characterized by 
their ability to hybridize, under stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with the 
nucleic acid fragments disclosed herein. 

10 Substantially similar nucleic acid fragments of the instant invention may also be 

characterized by the percent identity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Preferred are those nucleic acid fragments whose nucleotide 
sequences encode amino acid sequences that are 80% identical to the amino acid sequences 

15 reported herein. More preferred nucleic acid fragments encode amino acid sequences that 
are 90% identical to the amino acid sequences reported herein. Most preferred are nucleic 
acid fragments that encode amino acid sequences that are 95% identical to the amino acid 
sequences reported herein. Sequence alignments and percent identity calculations were 
performed using the Megalign program of the LAS ARGENE bioinformatics computing suite 

20 (DNASTAR Inc., Madison, WI). Multiple alignment of the sequences was performed using 
the Clustal method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the 
default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default 
parameters for pairwise alignments using the Clustal method were KTUPLE 1 , GAP 
PENALTY=3, WrNDOW=5 and DIAGONALS SAVED=5. 

25 A "substantial portion" of an amino acid or nucleotide sequence comprises an amino 

acid or a nucleotide sequence that is sufficient to afford putative identification of the protein 
or gene that the amino acid or nucleotide sequence comprises. Amino acid and nucleotide 
sequences can be evaluated either manually by one skilled in the art. or by using computer- 
based sequence comparison and identification tools that employ algorithms such as BLAST 

30 (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol. Biol. 275:403-410; see 
also www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence of ten or more contiguous 
amino acids or thirty or more contiguous nucleotides is necessary in order to putatively 
identify a polypeptide or nucleic acid sequence as homologous to a known protein or gene. 
Moreover, with respect to nucleotide sequences, gene-specific oligonucleotide probes 

35 comprising 30 or more contiguous nucleotides may be used in sequence-dependent methods 
of gene identification (e.g., Southern hybridization) and isolation (e.g., in situ hybridization 
of bacterial colonies or bacteriophage plaques). In addition, short oligonucleotides of 12 or 
more nucleotides may be used as amplification primers in PCR in order to obtain a particular 
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nucleic acid fragment comprising the primers. Accordingly, a "substantial portion" of a 
nucleotide sequence comprises a nucleotide sequence that will afford specific identification 
and/or isolation of a nucleic acid fragment comprising the sequence. The instant 
specification teaches amino acid and nucleotide sequences encoding polypeptides that 
5 comprise one or more particular plant proteins. The skilled artisan, having the benefit of the 
sequences as reported herein, may now use all or a substantial portion of the disclosed 
sequences for purposes known to those skilled in this art. Accordingly, the instant invention 
comprises the complete sequences as reported in the accompanying Sequence Listing, as 
well as substantial portions of those sequences as defined above. 

10 "Codon degeneracy" refers to divergence in the genetic code permitting variation of 

the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 
comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited 

15 by a specific host cell in usage of nucleotide codons to specify a given amino acid. 

Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, 
it is desirable to design the nucleic acid fragment such that its frequency of codon usage 
approaches the frequency of preferred codon usage of the host cell. 

"Synthetic nucleic acid fragments" can be assembled from oligonucleotide building 

20 blocks that are chemically synthesized using procedures known to those skilled in the art. 
These building blocks are ligated and annealed to form larger nucleic acid fragments which 
may then be enzymatically assembled to construct the entire desired nucleic acid fragment. 
"Chemically synthesized", as related to nucleic acid fragment, means that the component 
nucleotides were assembled in vitro. Manual chemical synthesis of nucleic acid fragments 

25 may be accomplished using well established procedures, or automated chemical synthesis 
can be performed using one of a number of commercially available machines. Accordingly, 
the nucleic acid fragments can be tailored for optimal gene expression based on optimization 
of nucleotide sequence to reflect the codon bias of the host cell. The skilled artisan 
appreciates the likelihood of successful gene expression if codon usage is biased towards 

30 those codons favored by the host. Determination of preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 

35 own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 
Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 
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from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 
that is introduced into the host organism by gene transfer. Foreign genes can comprise 
5 native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a nucleotide sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5 1 non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, 

10 and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a nucleotide sequence capable of controlling the expression of a 
coding sequence or functional RNA. In general, a coding sequence is located 3' to a 

15 promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
nucleotide sequence which can stimulate promoter activity and may be an innate element of 
the promoter or a heterologous element inserted to enhance the level or tissue-specificity of 
a promoter. Promoters may be derived in their entirety from a native gene, or be composed 

20 of different elements derived from different promoters found in nature, or even comprise 
synthetic nucleotide segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental conditions. 
Promoters which cause a nucleic acid fragment to be expressed in most cell types at most 

25 times are commonly referred to as "constitutive promoters". New promoters of various 
types useful in plant cells are constantly being discovered; numerous examples may be 
found in the compilation by Okamuro and Goldberg (1989) Biochemistry of Plants 75:1-82. 
It is further recognized that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, nucleic acid fragments of different lengths may have 

30 identical promoter activity. 

The "translation leader sequence" refers to a nucleotide sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
present in the fully processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 

35 mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner and Foster (1995) Molecular Biotechnology 3:225). 

The "3' non-coding sequences" refer to nucleotide sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
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encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is 
exemplified by Ingelbrecht et ai. (1989) Plant Cell i:671-680. 
5 "RNA transcript" refers to the product resulting from RNA poiymerase-catalyzed 

transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 

10 without introns and that can be translated into polypeptide by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to an RNA transcript that includes the mRNA and so can be translated into a 
polypeptide by the cell. "Antisense RNA" refers to an RNA transcript that is 
complementary to all or part of a target primary transcript or mRNA and that blocks the 

15 expression of a target gene (see U.S. Patent No. 5,107,065. incorporated herein by 

reference). The complementarity of an antisense RNA may be with any part of the specific 
nucleotide sequence, i.e.. at the 5' non-coding sequence, 3' non-coding sequence, introns. or 
the coding sequence. "Functional RNA" refers to sense RNA, antisense RNA, ribozyme 
RNA, or other RNA that may not be translated but yet has an effect on cellular processes. 

20 The term "operably linked" refers to the association of two or more nucleic acid 

fragments on a single nucleic acid fragment so that the function of one is affected by the 
other. For example, a promoter is operably linked with a coding sequence when it is capable 
of affecting the expression of that coding sequence (i.e., that the coding sequence is under 
the transcriptional control of the promoter). Coding sequences can be operably linked to 

25 regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of 

30 suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RNA 
transcripts capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Patent No. 5,231.020, incorporated herein by reference). 

35 "Altered levels" refers to the production of gene product(s) in transgenic organisms in 

amounts or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 



WO 00/04166 



PCT/US99/1S871 



"Precursor' protein refers to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
5 conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 
amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42:21-53). 
10 If the protein is to be directed to a vacuole, a vacuolar targeting signal (supra) can further be 
added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) 
may be added. If the protein is to be directed to the nucleus, any signal peptide present 
should be removed and instead a nuclear localization signal included (Raikhel (1992) Plant 
Phys. 700:1627-1632). 

15 "Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 

host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
methods of plant transformation include Agrobacterium-mediaXed transformation (De Blaere 
et al. (1987) Meth. Enzymol. 143:211) and particle-accelerated or "gene gun" transformation 

20 technology (Klein et al. (1987) Nature (London) 327:10-13; U.S. Patent No. 4,945,050, 
incorporated herein by reference). 

Standard recombinant DNA and molecular cloning techniques used herein are well 
known in the art and are described more fully in Sambrook et al. Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 

25 (hereinafter "Maniatis"). 

Nucleic acid fragments encoding at least a portion of a cellulose synthase enzyme have 
been isolated and identified by comparison of random plant cDNA sequences to public 
databases containing nucleotide and protein sequences using the BLAST algorithms well 
known to those skilled in the art. The nucleic acid fragments of the instant invention may be 

30 used to isolate cDNAs and genes encoding homologous proteins from the same or other 
plant species. Isolation of homologous genes using sequence-dependent protocols is well 
known in the art. Examples of sequence-dependent protocols include, but are not limited to, 
methods of nucleic acid hybridization, and methods of DNA and RNA amplification as 
exemplified by various uses of nucleic acid amplification technologies (e.g., polymerase 

35 chain reaction, ligase chain reaction). 

For example, genes encoding other cellulose synthase enzymes, either as cDNAs or 
genomic DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 
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methodology well known to those skilled in the art. Specific oligonucleotide probes based 
upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize 
DNA probes by methods known to the skilled artisan such as random primer DNA labeling, 
5 nick translation, or end-labeling techniques, or RNA probes using available in vitro 

transcription systems. In addition, specific primers can be designed and used to amplify a 
part or all of the instant sequences. The resulting amplification products can be labeled 
directly during amplification reactions or labeled after amplification reactions, and used as 
probes to isolate full length cDNA or genomic fragments under conditions of appropriate 
10 stringency. 

In addition, two short segments of the instant nucleic acid fragments may be used in 
polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding 
homologous genes from DNA or RNA. The polymerase chain reaction may also be 
performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 

15 is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 
advantage of the presence of the polyadenylic acid tracts to the 3' end of the mRNA 
precursor encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning vector. For example, the skilled artisan can follow 
the RACE protocol (Frohman et al. (1988) Proc. Natl. Acad. ScL USA §5:8998) to generate 

20 cDNAs by using PCR to amplify copies of the region between a single point in the 

transcript and the 3' or 5' end. Primers oriented in the 3' and 5' directions can be designed 
from the instant sequences. Using commercially available 3' RACE or 5' RACE systems 
(BRL), specific 3' or 5' cDNA fragments can be isolated (Ohara et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:5673; Loh et al. (1989) Science 243:217). Products generated by the 3' 

25 and 5' RACE procedures can be combined to generate full-length cDNAs (Frohman and 
Martin (1989) Techniques 7:165). 

Availability of the instant nucleotide and deduced amino acid sequences facilitates 
immunological screening of cDNA expression libraries. Synthetic peptides representing 
portions of the instant amino acid sequences may be synthesized. These peptides can be 

30 used to immunize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 
be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest 
(Lerner (1984) Adv. Immunol. 36:1; Maniatis). 

The nucleic acid fragments of the instant invention may be used to create transgenic 

35 plants in which the disclosed polypeptides are present at higher or lower levels than normal 
or in cell types or developmental stages in which they are not normally found. This would 
have the effect of altering the level of cellulose synthase in those cells. 
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Overexpression of the proteins of the instant invention may be accomplished by first 
constructing a chimeric gene in which the coding region is operably linked to a promoter 
capable of directing expression of a gene in the desired tissues at the desired stage of 
development. For reasons of convenience, the chimeric gene may comprise promoter 
5 sequences and translation leader sequences derived from the same genes. 3' Non-coding 
sequences encoding transcription termination signals may also be provided. The instant 
chimeric gene may also comprise one or more introns in order to facilitate gene expression. 

Plasmid vectors comprising the instant chimeric gene can then constructed. The 
choice of plasmid vector is dependent upon the method that will be used to transform host 

10 plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
the chimeric gene. The skilled artisan will also recognize that different independent 
transformation events will result in different levels and patterns of expression (Jones et al. 
(1985) EMBOJ. 4:241 1-2418; De Almeida et al. (1989) Mol. Gen. Genetics 2/5:78-86), and 

15 thus that multiple events must be screened in order to obtain lines displaying the desired 

expression level and pattern. Such screening may be accomplished by Southern analysis of 
DNA, Northern analysis of mRNA expression, Western analysis of protein expression, or 
phenotypic analysis. 

For some applications it may be useful to direct the instant polypeptides to different 

20 cellular compartments, or to facilitate its secretion from the cell. It is thus envisioned that 
the chimeric gene described above may be further supplemented by altering the coding 
sequence to encode the instant polypeptides with appropriate intracellular targeting 
sequences such as transit sequences (Keegstra (1989) Cell 56:247-253), signal sequences or 
sequences encoding endoplasmic reticulum localization (Chrispeels (1991) Ann. Rev. Plant 

25 Phys. Plant Mol. Biol. 42:2 1 -53), or nuclear localization signals (Raikhel ( 1 992) Plant 
Phys. i Oft 162 7- 1632) added and/or with targeting sequences that are already present 
removed. While the references cited give examples of each of these, the list is not 
exhaustive and more targeting signals of utility may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding the 

30 instant polypeptides in plants for some applications. In order to accomplish this, a chimeric 
gene designed for co-suppression of the instant polypeptide can be constructed by linking a 
gene or gene fragment encoding that polypeptide to plant promoter sequences. 
Alternatively, a chimeric gene designed to express antisense RNA for all or part of the 
instant nucleic acid fragment can be constructed by linking the gene or gene fragment in 

35 reverse orientation to plant promoter sequences. Either the co-suppression or antisense 

chimeric genes could be introduced into plants via transformation wherein expression of the 
corresponding endogenous genes are reduced or eliminated. 
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Molecular genetic solutions to the generation of plants with altered gene expression 
have a decided advantage over more traditional plant breeding approaches. Changes in plant 
phenotypes can be produced by specifically inhibiting expression of one or more genes by 
antisense inhibition or cosuppression (U. S. Patent Nos. 5.190,931, 5,107,065 and 
5 5,283,323). An antisense or cosuppression construct would act as a dominant negative 
regulator of gene activity. While conventional mutations can yield negative regulation of 
gene activity these effects are most likely recessive. The dominant negative regulation 
available with a transgenic approach may be advantageous from a breeding perspective. In 
addition, the ability to restrict the expression of specific phenotype to the reproductive 

10 tissues of the plant by the use of tissue specific promoters may confer agronomic 

advantages relative to conventional mutations which may have an effect in all tissues in 
which a mutant gene is ordinarily expressed. 

The person skilled in the art will know that special considerations are associated with 
the use of antisense or cosuppresion technologies in order to reduce expression of particular 

15 genes. For example, the proper level of expression of sense or antisense genes may require 
the use of different chimeric genes utilizing different regulatory elements known to the 
skilled artisan. Once transgenic plants are obtained by one of the methods described above, 
it will be necessary to screen individual transgenics for those that most effectively display 
the desired phenotype. Accordingly, the skilled artisan will develop methods for screening 

20 large numbers of transformants. The nature of these screens will generally be chosen on 
practical grounds, and is not an inherent part of the invention. For example, one can screen 
by looking for changes in gene expression by using antibodies specific for the protein 
encoded by the gene being suppressed, or one could establish assays that specifically 
measure enzyme activity. A preferred method will be one which allows large numbers of 

25 samples to be processed rapidly, since it will be expected that a large number of 
transformants will be negative for the desired phenotype. 

The instant polypeptides (or portions thereof) may be produced in heterologous host 
cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies to the 
these proteins by methods well known to those skilled in the art. The antibodies are useful 

30 for detecting the polypeptides of the instant invention in situ in cells or in vitro in cell 
extracts. Preferred heterologous host cells for production of the instant polypeptides are 
microbial hosts. Microbial expression systems and expression vectors containing regulatory 
sequences that direct high level expression of foreign proteins are well known to those 
skilled in the art. Any of these could be used to construct a chimeric gene for production of 

35 the instant polypeptides. This chimeric gene could then be introduced into appropriate 

microorganisms via transformation to provide high level expression of the encoded cellulose 
synthase. An example of a vector for high level expression of the instant polypeptides in a 
bacterial host is provided (Example 6). 
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Additionally, the instant polypeptides can be used as a targets to facilitate design 
and/or identification of inhibitors of those enzymes that may be useful as herbicides. This is 
desirable because the polypeptides described herein catalyze a step in the synthesis of 
cellulose. Accordingly, inhibition of the activity of one or more of the enzymes described 
5 herein could lead to inhibition plant growth. Thus, the instant polypeptides could be 
appropriate for new herbicide discovery and design. 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 
of, and as markers for traits linked to those genes. Such information may be useful in plant 

10 breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 
acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding patterns may then 
be subjected to genetic analyses using computer programs such as MapMaker (Lander et al. 

15 (1987) Genomics 7:174-181) in order to construct a genetic map. in addition, the nucleic 
acid fragments of the instant invention may be used to probe Southern blots containing 
restriction endonuclease-treated genomic DNAs of a set of individuals representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 
and used to calculate the position of the instant nucleic acid sequence in the genetic map 

20 previously obtained using this population (Botstein et al. (1980) Am. J. Hum. Genet. 
32:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in Bernatzky and Tanksley (1986) Plant Mol. Biol. Reporter 4(l):37-4l. 
Numerous publications describe genetic mapping of specific cDNA clones using the 
25 methodology outlined above or variations thereof. For example, F2 intercross populations, 
backcross populations, randomly mated populations, near isogenic lines, and other sets of 
individuals may be used for mapping. Such methodologies are well known to those skilled 
in the art. 

Nucleic acid probes derived from the instant nucleic acid sequences may also be used 
30 for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel et al. In: 

Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, pp. 319-346, 

and references cited therein). 

In another embodiment, nucleic acid probes derived from the instant nucleic acid 

sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask 
35 (1991) Trends Genet. 7:149-154). Although current methods of FISH mapping favor use of 

large clones (several to several hundred KB; see Laan et al. (1995) Genome Research 

5:13-20), improvements in sensitivity may allow performance of FISH mapping using 

shorter probes. 
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A variety of nucleic acid amplification-based methods of genetic and physical 
mapping may be carried out using the instant nucleic acid sequences. Examples include 
allele-specific amplification (Kazazian (1989) J. Lab. Clin. Med. 7/40:95-96), 
polymorphism of PCR-amplified fragments (CAPS; Sheffield et al. (1993) Genomics 
5 16:325-332), allele-specific ligation (Landegren et al. (1 988) Science 247:1077-1080), 
nucleotide extension reactions (Sokolov (1990) Nucleic Acid Res. J 5:3671), Radiation 
Hybrid Mapping (Walter et al. (1997) Nature Genetics 7:22-28) and Happy Mapping (Dear 
and Cook (1989) Nucleic Acid Res. 1 7:6795-6807). For these methods, the sequence of a 
nucleic acid fragment is used to design and produce primer pairs for use in the amplification 

10 reaction or in primer extension reactions. The design of such primers is well known to those 
skilled in the art. In methods employing PCR-based genetic mapping, it may be necessary 
to identify DNA sequence differences between the parents of the mapping cross in the 
region corresponding to the instant nucleic acid sequence. This, however, is generally not 
necessary for mapping methods. 

15 Loss of function mutant phenotypes may be identified for the instant cDNA clones 

either by targeted gene disruption protocols or by identifying specific mutants for these 
genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer (1989) Proc. Natl. Acad. Sci USA 86:9402; Koes et al. (1995) Proc. Natl. Acad. 
Sci USA 92:8149; Bensen et al. (1995) Plant Cell 7:75). The latter approach may be 

20 accomplished in two ways. First, short segments of the instant nucleic acid fragments may 
be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence 
primer on DNAs prepared from a population of plants in which Mutator transposons or some 
other mutation-causing DNA element has been introduced (see Bensen. supra). The 
amplification of a specific DNA fragment with these primers indicates the insertion of the 

25 mutation tag element in or near the plant gene encoding the instant polypeptides. 

Alternatively, the instant nucleic acid fragment may be used as a hybridization probe against 
PCR amplification products generated from the mutation population using the mutation tag 
sequence primer in conjunction with an arbitrary genomic site primer, such as that for a 
restriction enzyme site-anchored synthetic adaptor. With either method, a plant containing a 

30 mutation in the endogenous gene encoding the instant polypeptides can be identified and 

obtained. This mutant plant can then be used to determine or confirm the natural function of 
the instant polypeptides disclosed herein. 

EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
35 and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 
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departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usages and conditions. 

EXAMPLE 1 

Composition of cDNA Libraries: Isolation and Sequencing of cDNA Clones 
cDNA libraries representing mRNAs from various barley, corn, rice, soybean and 
wheat tissues were prepared. The characteristics of the libraries are described below. 

TABLE 2 

cDNA Libraries from Barley, Corn, Rice, Soybean and Wheat 



Library 



Tissue 



Clone 



bshl 
ceo In 

cdt2c 

crln 
csclc 

p0031 

pOllO 

p0097 
p0122 
rlr24 

sdp2c 
ses8w 

ssl 
wll 

wlln 

wrl 



Barley {Hordeum vulgare) sheath, developing seedling 

Com (Zea mays) cob of 67 day old plants grown in green 
house* 

Corn (Zea mays) developing tassel 2 



Corn (Zea mays) root from 7 day seedlings grown in light* 

Corn (Zea mays) 20 day seedling (germination under cold 
stress) 

Corn (Zea mays) shoot culture, initiated from seed derived 
meristems culture was maintained on 273N medium. 
Corn (Zea mays) stages V3/V4** leaf tissue minus midrib 
harvested 4 hours, 24 hours and 7 days after infiltration 
with salicylic acid, tissues pooled* 

Corn (Zea mays) stage V9** whorl section (7 cm) from 
plant infected four times with european corn borer 

Corn (Zea mays) pith tissue collected from internode 
subtending ear node 5 days after pollenation 
Rice (Oryza saliva) leaf (15 days after germination) 
24 hours after infection of Magaporthe grisea strain 
4360-R-62 (AVR2-YAMO); Resistant 

Soybean (Glycine max) developing pods 6-7 mm 

Soybean (Glycine max) mature embryo 8 weeks after 
subculture 

Soybean (Glycine max) seedling 5-10 day 

Wheat (Triticum aestivum) leaf 7 day old seedling, light 
grown 

Wheat (Triticum aestivum) leaf 7 day old seedling, light 
grown* 

Wheat (Triticum aestivum) root; 7 day old seedling, light 
grown 



bshl.pk0002.f6 
ccoln.pk0005.g3 

cdt2c.pk002.gl 
cdt2c.pk002.116 
crln.pk0135.el0 
csclc.pk002.il 

p0031.ccmarO5rb 

p0110.cgsma57r 

p0097.cqradl7rc 

p0122.ckamh70rc 

rlr24.pk0073.gl 

sdp2c.pk005.o22 
ses8w.pkO028.f3 

ssl.pk0036.cl0 
wll.pk0009.c9 

wlln.pk0044.bl 

wrl.pk0160.dll 
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Library 



Tissue 



Clone 



wreln 



Wheat {Triticum aestivum) root; 7 day old etiolated 
seedling* 



wreln.pk0043.f9 



wreln.pk0043.h8 
wreln.pk0131.gl0 



*These libraries were normalized essentially as described in U.S. Patent No. 5,482,845, 
incorporated herein by reference. 

**V3, V4 and V9 refer to stages of corn growth. The descriptions can be found in "How 
5 a Corn Plant Develops" Special Report No. 48, Iowa State University of Science and 
Technology Cooperative Extension Service Ames, Iowa, Reprinted February 1993. 

cDNA libraries may be prepared by any one of many methods available. For 
example, the cDNAs may be introduced into plasmid vectors by first preparing the cDNA 

10 libraries in Uni-ZAP™ XR vectors according to the manufacturer's protocol (Stratagene 
Cloning Systems. La Jolla, CA). The Uni-ZAP™ XR libraries are converted into plasmid 
libraries according to the protocol provided by Stratagene. Upon conversion. cDNA inserts 
will be contained in the plasmid vector pBluescript. In addition, the cDNAs may be 
introduced directly into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA 

15 ligase (New England Biolabs), followed by transfection into DH10B cells according to the 
manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts are in plasmid 
vectors, plasmid DNAs are prepared from randomly picked bacterial colonies containing 
recombinant pBluescript plasmids, or the insert cDNA sequences are amplified via 
polymerase chain reaction using primers specific for vector sequences flanking the inserted 

20 cDNA sequences. Amplified insert DNAs or plasmid DNAs are sequenced in dye-primer 
sequencing reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs'"; see Adams et al., (1991) Science 252:1651). The resulting ESTs are analyzed using 
a Perkin Elmer Model 377 fluorescent sequencer. 



cDNA clones encoding cellulose synthase enzymes were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol. Biol. 
275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS 
30 translations, sequences derived from the 3-dimensional structure Brookhaven Protein Data 
Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and 
DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed for similarity 
to all publicly available DNA sequences contained in the "nr" database using the BLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). The 
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DNA sequences were translated in all reading frames and compared for similarity to ail 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish and States (1993) Nature Genetics J:266-272) provided by the NCBL For 
convenience, the P-value (probability) of observing a match of a cDNA sequence to a 
5 sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-vaiue. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

10 Characterization of cDNA Clones Encoding Cellulose Synthase 

The BLASTX search using the EST sequences from clones listed in Table 3 revealed 
similarity of the polypeptides encoded by the cDNAs to cellulose synthase from Arabidopsis 
thaliana (NCBI Identifier No. gi 2827139, gi 2827141, gi 4467125, gi 4886756 and gi 
313561 1) and Gossypium hirsutum ('NCBI Identifier No. gi 1706958 and 5081779). Shown 

15 in Table 3 are the BLAST results for individual ESTs ("EST"), the sequences of the entire 
cDNA inserts comprising the indicated cDNA clones ("FIS"), complete gene sequences 
("CGS") or contigs assembled from two or more ESTs ("Contig"): 



TABLE 3 

BLAST Results for Sequences Encoding Polypeptides Homologous 
to Arabidopsis thaliana and Gossypium hirsutum Cellulose Synthase 



Clone 


Status 


BLAST pLog Score 


bshl.pk0002.f6 


FIS 


154.00 (gi 2827139) 


Contig composed of: 
ccoln.pk0005.g3 
cdt2c.pk002.el 
cdt2c.pk002.n6 
csclc.pk002.il 
p0031.ccmar05rb 
pOl 10.cgsma57r 


Contig 


>254.00 (gi 2827141) 


crln.pk0135.el0 


FIS 


176.00 (gi 1706958) 


p0097.cqrad!7rc 


CGS 


>254.00 (gi 2827141) 


pOI22.ckamh70rc 


CGS 


>254.00 (gi 2827141) 


rlr24.pk0073.gl 


EST 


77.70 (gi 4467125) 


sdp2c.pk005.o22 


FIS 


>254.00 (gi 4886756) 


ses8w.pk0028.f3 


EST 


>254.00 (gi 2827139) 


ssl.pk0036.cl0 


EST 


>254.00 (gi 2827141) 


Contig composed of: 
wll.pk0009.c9 
wrl.pk0160.dll 
wreln.pk0043.f9 


Contig 


>254.00 (gi 5081779) 
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wreln.pk0043.h8 
wreln.pk0131.gl0 

wlln.pk0044.bl EST 166.00 (gi 3135611) 

Figure 1 presents an alignment of the amino acid sequences set forth in SEQ ID 
NOs:2, 4, 8, 10, 12, 14. 16, 18. 20 and 22 and the Arabidopsis thaliana (SEQ ID NOs:23 
(gi 2827139), 24 (gi 2827141), 26 (gi 4467125), 27 (gi 4886756) and 29 (gi 313561 1) ) and 
5 Gossypiwn hirsutum (SEQ ID NOs:25 (gi 1706958) and 28 (gi 5081779)) sequences. The 
data in Table 4 represents a calculation of the percent identity of the amino acid sequences 
set forth in SEQ ID NOs:2, 4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thaliana 
(SEQ ID NOs:23, 24, 26, 27 and 29) and Gossypium hirsutum (SEQ ID NOs:25 and 28) 
sequences. 

10 

TABLE 4 

Percent Identity of Amino Acid Sequences Deduced From the Nucleotide Sequences 
of cDNA Clones Encoding Polypeptides Homologous 
to Arabidopsis thaliana and Gossypium hirsutum Cellulose Synthase 



SEQ ID NO. 


Percent Identity to 


2 


82% (gi 2827139) 


4 


69% (gi 2827141) 


6 


89% (gi 1706958) 


8 


70% (gi 2827141) 


10 


70% (gi 2827141) 


12 


36%> (gi 4467125) 


14 


86% (gi 4886756) 


16 


88% (gi 2827139) 


18 


86% (gi 2827141) 


20 


87% (gi 5081779) 


22 


70% (gi 3135611) 



Sequence alignments and percent identity calculations were performed using the 
Megalign program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clustai 
method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 
20 parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for 
pairwise alignments using the Clustai method were KTUPLE 1, GAP PENALTY=3, 
WINDOW=5 and DIAGONALS SAVED=5. Sequence alignments and BLAST scores and 
probabilities indicate that the nucleic acid fragments comprising the instant cDNA clones 
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encode a substantial portion of a cellulose synthase. These sequences represent the first 
barley, corn, rice, soybean and wheat sequences encoding cellulose synthase. 

EXAMPLE 4 
Expression of Chimeric Genes in Monocot Cells 
5 A chimeric gene comprising a cDNA encoding the instant polypeptides in sense 

orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 
fragment, and the 10 kD zein 3' end that is located 3' to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 

10 (Ncol or Smal) can be incorporated into the oligonucleotides to provide proper orientation 
of the DNA fragment when inserted into the digested vector pML103 as described below. 
Amplification is then performed in a standard PCR. The amplified DNA is then digested 
with restriction enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal fragment of the 

15 plasmid pML 1 03 . Plasmid pML 1 03 has been deposited under the terms of the Budapest 
Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, 
VA 201 10-2209), and bears accession number ATCC 97366. The DNA segment from 
pML103 contains a 1.05 kb Sall-Ncol promoter fragment of the maize 27 kD zein gene and 
a 0.96 kb Smal-Sall fragment from the 3' end of the maize 10 kD zein gene in the vector 

20 pGem9Zf(+) (Promega). Vector and insert DNA can be iigated at 1 5°C overnight, 

essentially as described (Maniatis). The Iigated DNA may then be used to transform E. coli 
XLl-Blue (Epicurian Coli XL-1 Blue™; Stratagene). Bacterial transformants can be 
screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence 
analysis using the dideoxy chain termination method (Sequenase™ DNA Sequencing Kit: 

25 U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene 
encoding, in the 5' to 3' direction, the maize 27 kD zein promoter, a cDNA fragment 
encoding the instant polypeptides, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into corn cells by the 
following procedure. Immature corn embryos can be dissected from developing caryopses 

30 derived from crosses of the inbred corn lines H99 and LH132. The embryos are isolated 10 
to 1 1 days after pollination when they are 1 .0 to 1 .5 mm long. The embryos are then placed 
with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al. 
(1975) Sci. Sin. Peking 1 8:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 

35 proembryoids and embryoids borne on suspensor structures proliferates from the scutellum 
of these immature embryos. The embryogenic callus isolated from the primary explant can 
be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 
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The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 
which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 
5 resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et ai. (1985) Nature 313:810-812) and the 3* region of the nopaline synthase gene 
from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al. (1987) Nature 327:70-73) may be used 

10 to transfer genes to the callus culture cells. According to this method, gold particles (1 jam 
in diameter) are coated with DNA using the following technique. Ten |_ig of plasmid DNAs 
are added to 50 uL of a suspension of gold particles (60 mg per mL). Calcium chloride 
(50 uL of a 2.5 M solution) and spermidine free base (20 uL of a 1 .0 M solution) are added 
to the particles. The suspension is vortexed during the addition of these solutions. After 

15 10 minutes, the tubes are briefly centrifuged (5 sec at 1 5,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 uL of absolute ethanol. centrifuged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 
resuspended in a final volume of 30 uL of ethanol. An aliquot (5 u.L) of the DNA-coated 
gold particles can be placed in the center of a Kapton™ flying disc (Bio-Rad Labs). The 

20 particles are then accelerated into the corn tissue with a Biolistic™ PDS- 1 000/He (Bio-Rad 
Instruments, Hercules CA), using a helium pressure of 1000 psi, a gap distance of 0.5 cm 
and a flying distance of 1 .0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over agarose- 
solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 

25 about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 
the PDS- 1 000/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the shock 
tube reaches 1000 psi. 

30 Seven days after bombardment the tissue can be transferred to N6 medium that 

contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
of actively growing callus can be identified on some of the plates containing the glufosinate- 

35 supplemented medium. These calli may continue to grow when sub-cultured on the 
selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
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tissue can be transferred to regeneration medium (Fromm et al. ( 1 990) Bio/Technology 
5:833-839). 

EXAMPLE 5 
Expression of Chimeric Genes in Dicot Cells 
A seed-specific expression cassette composed of the promoter and transcription 
terminator from the gene encoding the p subunit of the seed storage protein phaseolin from 
the bean Phaseolus vulgaris (Doyle et al. (1986) J. Biol. Chem. 261 :9228-9238) can be used 
for expression of the instant polypeptides in transformed soybean. The phaseolin cassette 
includes about 500 nucleotides upstream (5') from the translation initiation codon and about 
1650 nucleotides downstream (3') from the translation stop codon of phaseolin. Between the 
5' and 3' regions are the unique restriction endonuclease sites Nco I (which includes the ATG 
translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by 
Hind III sites. 

The cDN A fragment of this gene may be generated by polymerase chain reaction 
(PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
when inserted into the expression vector. Amplification is then performed as described 
above, and the isolated fragment is inserted into a pUC18 vector carrying the seed 
expression cassette. 

Soybean embroys may then be transformed with the expression vector comprising 
sequences encoding the instant polypeptides. To induce somatic embryos, cotyledons, 
3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the light or dark at 26°C on an appropriate agar medium for 
6-10 weeks. Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable liquid medium. After repeated selection for clusters of somatic 
embryos which multiplied as early, globular staged embryos, the suspensions are maintained 
as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a 
rotary shaker, 150 rpm, at 26°C with florescent lights on a 16:8 hour day/night schedule. 
Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the method of 
particle gun bombardment (Klein et al. (1987) Nature (London) 527:70, U.S. Patent 
No. 4,945,050). A DuPont Biolistic™ PDS 1000/HE instrument (helium retrofit) can be used 
for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 
chimeric gene composed of the 35 S promoter from Cauliflower Mosaic Virus (Odell et al. 
(1985) Nature 31 5:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 
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(from E. colt Gritz et al.(1983) Gene 25:179-188) and the 3' region of the nopaline synthase 
gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed expression 
cassette comprising the phaseolin 5' region, the fragment encoding the instant polypeptides 
and the phaseolin 3' region can be isolated as a restriction fragment. This fragment can then 
5 be inserted into a unique restriction site of the vector carrying the marker gene. 

To 50 uL of a 60 mg/mL 1 um gold particie suspension is added (in order): 5 uL 
DNA (1 ug/uL), 20 ul spermidine (0.1 M), and 50 uL CaCl 2 (2.5 M). The particie 
preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 uL 70% 

10 ethanol and resuspended in 40 uL of anhydrous ethanoi. The DN A/particle suspension can 
be sonicated three times for one second each. Five uL of the DNA-coated gold particles are 
then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in an 
empty 60x1 5 mm petri dish and the residual liquid removed from the tissue with a pipette. 

15 For each transformation experiment, approximately 5-10 plates of tissue are normally 

bombarded. Membrane rupture pressure is set at 1 100 psi and the chamber is evacuated to a 
vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 

20 Five to seven days post bombardment, the liquid media may be exchanged with fresh 

media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 
hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bombardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 

25 individual flasks to generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 
suspensions can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 6 

30 Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant polypeptides can be inserted into the T7 E. coli 
expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) 
Gene 5(5:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter 
system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites in 

35 pET-3a at their original positions. An oligonucleotide adaptor containing EcoR I and 
Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM with 
additional unique cloning sites for insertion of genes into the expression vector. Then, the 
Nde I site at the position of translation initiation was converted to an Nco I site using 
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oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 
5'-CATATGG, was converted to 5'-CCCATGG in pBT430. 

Plasmid DNA containing a cDN A may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1 % NuSieve 
5 GTG™ low melting agarose gel (FMC). Buffer and agarose contain 10 u.g/ml ethidium 

bromide for visualization of the DNA fragment. The fragment can then be purified from the 
agarose gel by digestion with GELase™ (Epicentre Technologies) according to the 
manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 \xL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA iigase 

10 (New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 
purified from the excess adapters using low melting agarose as described above. The vector 
pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 16°C for 15 hours followed by transformation into DH5 electrocompetent 

15 cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 
1 00 ug/mL ampicillin. Transformants containing the gene encoding the instant polypeptides 
are then screened for the correct orientation with respect to the T7 promoter by restriction 
enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
20 orientation relative to the T7 promoter can be transformed into E. coli strain BL2 1 (DE3) 
(Studier et al. (1986) J. Mol. Biol. 189: 1 13-130). Cultures are grown in LB medium 
containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 nm of approximately 
1, IPTG (isopropylthio-p-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25°. Cells are then harvested by 
25 centrifugation and re-suspended in 50 uL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 
DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the supernatant 
determined. One u.g of protein from the soluble fraction of the culture can be separated by 
30 SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 
at the expected molecular weight. 

EXAMPLE 7 
Evaluating Compounds for Their Ability to inhibit 
the Activity of Cellulose Synthase 
35 The polypeptides described herein may be produced using any number of methods 

known to those skilled in the art. Such methods include, but are not limited to, expression in 
bacteria as described in Example 6, or expression in eukaryotic cell culture, in planta, and 
using viral expression systems in suitably infected organisms or cell lines. The instant 
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polypeptides may be expressed either as mature forms of the proteins as observed in vivo or 
as fusion proteins by covaient attachment to a variety of enzymes, proteins or affinity tags. 
Common fusion protein partners include glutathione S-transferase ("GST"), thioredoxin 
("Trx"), maltose binding protein, and C- and/or N-terminal hexahistidine polypeptide 
5 ("(His) 6 "). The fusion proteins may be engineered with a protease recognition site at the 
fusion point so that fusion partners can be separated by protease digestion to yield intact 
mature enzyme. Examples of such proteases include thrombin, enterokinase and factor Xa. 
However, any protease can be used which specifically cleaves the peptide connecting the 
fusion protein and the enzyme. 

10 Purification of the instant polypeptides, if desired, may utilize any number of 

separation technologies familiar to those skilled in the art of protein purification. Examples 
of such methods include, but are not limited to, homogenization, filtration, centrifugation, 
heat denaturation, ammonium sulfate precipitation, desalting, pH precipitation, ion exchange 
chromatography, hydrophobic interaction chromatography and affinity chromatography, 

15 wherein the affinity ligand represents a substrate, substrate analog or inhibitor. When the 
instant polypeptides are expressed as fusion proteins, the purification protocol may include 
the use of an affinity resin which is specific for the fusion protein tag attached to the 
expressed enzyme or an affinity resin containing ligands which are specific for the enzyme. 
For example, the instant polypeptides may be expressed as a fusion protein coupled to the 

20 C-terminus of thioredoxin. In addition, a (His) 6 peptide may be engineered into the 

N-terminus of the fused thioredoxin moiety to afford additional opportunities for affinity 
purification. Other suitable affinity resins could be synthesized by linking the appropriate 
ligands to any suitable resin such as Sepharose-4B. In an alternate embodiment, a 
thioredoxin fusion protein may be eluted using dithiothreitol; however, elution may be 

25 accomplished using other reagents which interact to displace the thioredoxin from the resin. 
These reagents include p-mercaptoethanol or other reduced thiol. The eluted fusion protein 
may be subjected to further purification by traditional means as stated above, if desired. 
Proteolytic cleavage of the thioredoxin fusion protein and the enzyme may be accomplished 
after the fusion protein is purified or while the protein is still bound to the ThioBond™ 

30 affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion protein, may be 
utilized in assays for the evaluation of compounds for their ability to inhibit enzymatic 
activation of the instant polypeptides disclosed herein. Assays may be conducted under well 
known experimental conditions which permit optimal enzymatic activity. For example, 

35 assays for cellulose synthase activity are presented in WO 98/1 8949 and WO 98/00549. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment comprising at least 900 nucleotides, wherein 
the nucleic acid fragment encodes a cellulose synthase comprising a member selected from 
the group consisting of: 

(a) an isolated nucleic acid fragment encoding an amino acid sequence that 
is at least 90% identical to the amino acid sequence set forth in a 
member selected from the group consisting of SEQ ID NO:2, 6, 12, 14, 
16, 18, 20 and 22; 

(b) an isolated nucleic acid fragment that is complementary to (a). 

2. The isolated nucleic acid fragment of Claim 1 wherein nucleic acid fragment is 
a functional RNA. 

3. The isolated nucleic acid fragment of Claim 1 wherein the nucleotide sequence 
of the fragment comprises the sequence set forth in a member selected from the group 
consisting of SEQ IDNO:l, 5, 11, 13, 15. 17, 19 and 21. 

4. A chimeric gene comprising the nucleic acid fragment of Claim 1 operably 
linked to suitable regulatory sequences. 

5. A transformed host cell comprising the chimeric gene of Claim 4. 

6. A cellulose synthase polypeptide comprising all or a substantial portion of the 
amino acid sequence set forth in a member selected from the group consisting of SEQ ID 
NO:2, 6, 12, 14, 16, 18, 20 and 22. 

7. An isolated nucleic acid fragment encoding a cellulose synthase comprising a 
member selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding an amino acid sequence that 
is functionally acitve polypeptide and at least 80% identical to the 
amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:4, 8 and 10 ; 

(b) an isolated nucleic acid fragment that is complementary to (a). 

8. The isolated nucleic acid fragment of Claim 7 wherein nucleic acid fragment is 
a functional RNA. 

9. The isolated nucleic acid fragment of Claim 7 wherein the nucleotide sequence 
of the fragment comprises the sequence set forth in a member selected from the group 
consisting of SEQ ID NO:3, 7 and 9. 

10. A chimeric gene comprising the nucleic acid fragment of Claim 7 operably 
linked to suitable regulatory sequences. 

11. A transformed host cell comprising the chimeric gene of Claim 1 0. 
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12. A cellulose synthase polypeptide comprising all or a substantial portion of the 
amino acid sequence set forth in a member selected from the group consisting of SEQ ID 
NO:4, 8, 10. 

13. A method of altering the level of expression of a cellulose synthase in a host 
5 cell comprising: 

(a) transforming a host cell with the chimeric gene of any of Claims 4 and 
10; and 

(b) growing the transformed host cell produced in step (a) under conditions 
that are suitable for expression of the chimeric gene 

10 wherein expression of the chimeric gene results in production of altered levels of a cellulose 
synthase in the transformed host cell. 

14. A method of obtaining a nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence encoding a cellulose synthase comprising: 

(a) probing a cDNA or genomic library with the nucleic acid fragment of 
15 any of Claims 1 and 7; 

(b) identifying a DNA clone that hybridizes with the nucleic acid fragment 
any of of Claims land 7; 

(c) isolating the DNA clone identified in step (b); and 

(d) sequencing the cDNA or genomic fragment that comprises the clone 
20 isolated in step (c) 

wherein the sequenced nucleic acid fragment encodes all or a substantial portion of the 
amino acid sequence encoding a cellulose synthase. 

15. A method of obtaining a nucleic acid fragment encoding a substantial portion 
of an amino acid sequence encoding a cellulose synthase comprising: 

25 (a) synthesizing an oligonucleotide primer corresponding to a portion of 

the sequence set forth in any of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 
17, 19 and 21; and 
(b) amplifying a cDNA insert present in a cloning vector using the 

oligonucleotide primer of step (a) and a primer representing sequences 
30 of the cloning vector 

wherein the amplified nucleic acid fragment encodes a substantial portion of an amino acid 
sequence encoding a cellulose synthase. 

1 6 . The product of the method of Claim 1 4. 

17. The product of the method of Claim 15. 

35 18. A method for evaluating at least one compound for its ability to inhibit the 

activity of a cellulose synthase, the method comprising the steps of: 
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(a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a cellulose synthase, operably linked to suitable 
regulatory sequences; 

(b) growing the transformed host cell under conditions that are suitable for 
expression of the chimeric gene wherein expression of the chimeric 
gene results in production of the cellulose synthase encoded by the 
operably linked nucleic acid fragment in the transformed host cell; 

(c) optionally purifying the cellulose synthase expressed by the 
transformed host cell; 

(d) treating the cellulose synthase with a compound to be tested; and 

(e) comparing the activity of the cellulose synthase that has been treated 
with a test compound to the activity of an untreated cellulose synthase, 

thereby selecting compounds with potential for inhibitory activity. 
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Figure 1 (cont'd.) 
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Figure 1 (cont'd.) 

YGSVAWKDRMEVWKRRQGEKLQVIKHEGGNNGRGSN-DDDELDDPDMPMM-DEGRQPLSR 

YGNAYWP QDEMYGD DMDEGMRGGMVETADKPWRPLSR 

WRERMDDWKLQHG NLGPEPDDDPEMGLI -DEARQPLSR 

YGSVAWKDRMEEWKRKQNEKLQVVRHEGDP DFEDGDDADFPMM-DEGRQPLSM 

361 420 

KIPLPSSQINPYRMIIIIRLVVLCFFFHYRVMHPVPDAFALWLISVICEIWFAMSWILDQ 

KVPISSSRINPYRMIIVIRLVVLGFFFHYRVMHPAKDAFALWLISVICEIWFAMSWILDQ 
KIPLPSSQINPYRMIIIIRLVVLGFFFHYRVMHPVNDAFALWLISVICEIWFAMSWILDQ 

KVPIASSKINPYRMVIVARLVILAFFLRYRLMNPVHDALGLWLTSIICEIWFAFSWILDQ 

LHPVNDAYGLWLTSVICEIWFAVSWIMDQ 

KVPISSSRINPYRMIIVIRLVVLGFFFHYRVMHPAKDAFALWLISVICEIWFAMSCILDQ 
VVPIPSSRLTFYRVVIILRLIILCFFLQYRTTHPVKNAYPLWLTSVICEIWFAFSWLLDQ 
KLPIRSSRINPYRMLILCRLAILGLFFHYRILHPVNDAYGLWLTSVICEIWFAVSWILDQ 

RIPIPAAIISPYRLLIVIRFWLCFFLTWRIRNPNEDAIWLWLMSIICELWFGFSWILDQ 
KVPIASSKINPYRMVIVARLVILAVFLRYRLLNPVHDALGLWLTSVICEIWFAVSWILDQ 

KIPIKSSKINPYRMLIVLRLVILGLFFHYRILHPVKDAYALWLISVICEIWFAVSWVLDQ 

421 480 

FPKWFPIERETYLDRLSLRFDKEGHPS QLAPVDFFVSTVDPLKEPPLVTANTVLS 

FPKWLPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPTKEPPLVTANTVLS 

FPKWFPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPLKEPPLVTTNTVLS 

FPKWFPIDRETYLDRLSIRYEREGEPN MLAPVDVFVSTVDPMKEPPLVTANTVLS 

FPKWYPIQRETYLDRLSLRYEKEGKPS ELS S VDVFVS T VD PMKE P PL I T ANTVLS 

FPKWFPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPTKEPPLVTANTVLS 

FPKWYPINRETYLDRLAIRYDRDGEPS QLV P VDVFVS TVDPLKE P PLVT ANTVLS 

FPKWYPIERETYLDRLSLRYEKEGKPS GLAPVDVFVSTVDPLKEPPLITANTVLS 

IPKLCPINRSTDLEVLRDKFDMPSPSNPTGRSDLPGIDLFVSTADPEKEPPLVTANTILS 
FPKWFPIERETYLDRLSLRYEREGEPN MLAPVDVFVSTVDPLKEPPLVTSNTVLS 

FPKWYPIERETYLDRLSLRYEKEGKPS GLSPVDVFVSTVDPLKEPPLITANTVLS 

481 540 

ILSVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYSLEPRAPEWYFQQ— 
H 

ILSVDYPVEKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFSKKFNIEPRAPEWYFQQ— 
ILSVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYNIEPRAPEWYFQQ— 
ILAAGYPAGKVTCYISDDAGAEVTRNAVVEAARFAALWVSFCRKHGVEPRNLEAYFNAGE 
ILAMDYPVDKISCYISDDGASMCTFESLSETAEFARKWVPFCKKFSIEPRAPEMYFSE— 

ILAVDYPVDKVACYVSDDGAAMLTFEALSETSEFARRWVPFCKKYNIEPRAPEWYFGQ — 
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Figure 1 (cont'd.) 

SEQ ID NO: 20 

SEQ ID NO: 22 ILSVDYPVEKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFSKKFNIEPRAPEWYFQQ — 

SEQ ID NO: 23 ILSVDYPVDKVACYVSDDGSAMLTFESLSETAEFAKKWVPFCKKFNIEPRAPEFYFAQ-- 

SEQ ID NO: 24 ILAVDYPVDKVACYVSDDGAAMLTFEALSDTAEFARKWVPFCKKFNIEPRAPEWYFSQ-- 

SEQ ID NO: 25 RRWVPFCKKHNVEPRAPEFYFNE — 

SEQ ID NO: 26 ILAVDYPVEKVSCYLSDDGGALLSFEAMAEAASFADLWVPFCRKHNIEPRNPDSYFSL— 

SEQ ID NO: 27 ILAMDYPVEKISCYVSDDGASMLTFESLSETAEFARKWVPFCKKFSIEPRAPEMYFTL— 

SEQ ID NO: 28 DYPVEKVSCYVS DDGAAMLT FEALS ETSEFARKWVPFCKKYN IE PRAPEWYFAQ — 

SEQ ID NO: 29 I LAVDYPVDKVACYVS DDGAAMLT FEALS ETAEFARKWVPFCKKYC IE PRAPEWYFCH — 

541 6 00 

SEQ ID NO: 2 

SEQ ID NO: 4 KIDYLKDKVAPNFVRERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 6 

SEQ ID NO: 8 KI DYLKDKVAAS FVRERRAMKREYEE FKVRI NALVAKAQ 

SEQ ID NO: 10 KIDYLKDKVAANFVRERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 12 GGGGKAKVVARGSY-RGMAWPELVRDRRRVRREYEEMRLRIDALQAADARRR 

SEQ ID NO: 14 KIDYLKDKVQPTFVKERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 16 AKAQ 

SEQ ID NO: 18 KMDYLKNKVHPAFVRERRAMKRDYEEFKVRINSLVATAQ 

SEQ ID NO: 20 

SEQ ID NO: 22 KI DYLKDKVAAS FVRERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 23 KIDYLKDKIQPS FVKE RRAMKR E Y E E FKVR I NALVAKAQ 

SEQ ID NO: 24 KMDYLKNKVHPAFVRERRAMKRDYEEFKVKINALVATAQ 

SEQ ID NO: 25 KIDYLKDKVHPSFVKERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 2 6 KIDPTKNKSRIDFVKDRRKIKREYDEFKVRINGLPDSIRRRSDAFNAREE 

SEQ ID NO: 27 KVDYLQDKVHPTFVKERRAMKREYEEFKVRINAQVAKAS 

SEQ ID NO: 28 KI DYLKDKVQTS FVKERRAMKREYEEFKVRVNGLVAKAQ 

SEQ ID NO: 29 KMDYLKNKVHPAFVRERRAMKRDYEEFKVKINALVATAQ 



601 660 
SEQ ID NO: 2 

SEQ ID NO: 4 KVPEEGWTMQDGTPWPG NNVRDKPGMIQVFL 

SEQ ID NO: 6 

SEQ ID NO: 8 KVPEEGWTMQDGSPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 10 KVPEEGWTMQDGTPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 12 RRGAADDHAGVVQVLIDFA 

SEQ ID NO: 14 KVPQGGWIMQDGTPWPG NNTKDHPGMIQVFL 

SEQ ID NO: 16 KMPEEGWTMQDGTPWPG NNPRDHPGMIQVFL 

SEQ ID NO: 18 KVPEDGWTMQDGTPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 20 

SEQ ID NO: 22 KVPEEGWTMQDGSPWPG 

SEQ ID NO: 23 KIPEEGWTMQDGTPWPG NNTRDHPGMIQVFL 

SEQ ID NO: 24 KVPEEGWTMQDGTPWPG NNVRDHPGMIQVFL 

SEQ ID NO: 25 KKPEEGWVMQDGTPWPG NNTRDHPGMIQVYL 

SEQ ID NO: 26 MKALKQMRESGGDPTEPVKVPKATW-MADGTHWPGTWAASTREHSKGDHAGILQVMLKPP 

SEQ ID NO: 27 KVPLEGWIMQDGTPWPG NNTKDHPGMIQVFL 

SEQ ID NO: 28 KVPEEGWIMQDGT PWPG NNTRDHPGMIQVFL 

SEQ ID NO: 2 9 KVPEDGWTMQDGTPWPG NSVRDHPGMIQVFL 

661 720 
SEQ ID NO:2 

SEQ ID NO: 4 G-QSGGHDVE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLTNA 

SEQ ID NO: 6 

SEQ ID NO: 8 G-QSGGRDVE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLSNA 

SEQ ID NO: 10 G-QSGGLDCE GNELPRLVYVS REKRPGYNHHKKAGAMNALVRVSAVLTNA 

SEQ ID NO: 12 GSVPQLGVANGSKLIDVASVDVCLPALVYVCREKRRGHAHHRKAGAMNA 
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SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO:24 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 
SEQ ID NO: 28 
SEQ ID NO: 29 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 
SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO:22 
SEQ ID NO: 23 
SEQ ID NO:24 
SEQ ID NO: 25 
SEQ ID NO:26 
SEQ ID NO: 27 
SEQ ID NO: 28 
SEQ ID NO: 29 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 
SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 
SEQ ID NO: 18 
SEQ ID NO:20 
SEQ ID NO: 22 
SEQ ID NO:23 
SEQ ID NO:24 
SEQ ID NO: 25 
SEQ ID NO:26 
SEQ ID NO: 27 
SEQ ID NO:28 
SEQ ID NO:29 



Figure 1 (cont'd.) 

G-SSGGLDTE GNQLPRLVYVSREKRPGFQHHKKAGAMNALVRVSAVLTNA 

G-HSGGLDTD GNELPRLVYVSREKRPGFQHHKKAGAMNALIRVSAVLTNG 

G-QDGVRDVE GNEL PRLVYVS REKRPG FDHHKKAGAMNAL VRASAI I T NA 



G-HSGGLDTD GNELPRLI YVSREKRPGFQHHKKAGAMNALIRVSAVLTNG 

G-HSGVRDTD GNELPRLVYVS REKRPG FDHHKKAGAMNSLIRVSAVLSNA 

G- S AGAL DVD GKEL PRLVYVS REKRPG YQHHKKAGAENALVRVS AVLTNA 

SSDPLIG-NSDDKVIDFSDTDTRLPMFVYVS REKRPG YDHNKKAGAMNALVRASAILSNG 

G-HSGGFDVE GHEL PRLVYVS REKRPG FQHHKRAGAMNALVRVAG VLTNA 

G-QSGGLDAE GNELPRLVYVS REKRPG FQHHKKAGAMNALVRVSAVLTNG 

G-SDGVRDVE NNELPRLVYVSREKRPGFDHHKKAGAMNSLIRVSGVLSNA 

721 780 



PYLLNLDCDHYINNSKAIKEAMCFMMDPLLGKK VCYVQFPQRFDGI DRHDRYAN 



AYLLNLDCDHYINNSKAIKEAMCFMMDPLVGKK VCYVQFPQRFDGI DKNDRYAN 

PYLLNLDCDHYINNSKAIKEAMCFMMDPLLGKK VCYVQFPQRFDGI DRHDRYAN 

PFILDLDCDYYVNNSQALRAGICFMIERGGGGAAEDAGAVAFVQFPQRVDGVDPGDRYAN 

PFMLNLDCDHYVNNSKAAREAMCFLMDPQTGKK VCYVQFPQRFDGI DTHDRYAN 

AYLLNVDCDH YFNNS KALKEAMC FMMDPVLGKK TCYVQ FPQRFDG I DLHDRYAN 

PYLLNVDCDHYINNSKALREAMCFMMDPQLGKK VCYVQ FPQRFDG I DRHDRYSN 

EAMCFLMDPNLGPQ VCYVQ FPQRFDGIDRNDRYAN 



AYLLNVDCDHYFNNSKAIKEAMCFMMDPAIGKK CCYVQFPQRFDGI DLHDRYAN 

PYLLNVDCDHYINNSKAIRESMCFMMDPQSGKK VCYVQFPQRFDGI DRHDRYSN 

PFILNLDCDHYINNSKAMREAMCFLMDPQFGKK LCYVQ FPQRFDG I DRHDRYAN 

PFILNLDCDHYIYNCKAVREGMCFMMDRG-GED ICYIQFPQRFEGIDPSDRYAN 

PFMLNLDCDHYVNNSKAVREAMCFLMDPQIGKK VCYVQFPQRFDGI DTNDRYAN 

AFLLNLDCDHYINNSKALREAMCFLMDPNLGKQ VCYVQFPQRFDGI DRNDRYAN 

PYLLNVDCDHYINNSKALREAMCFMMDPQSGKK ICYVQFPQRFDGI DRHDRYSN 

781 840 



RNVVFFDINMKGLDGIQGPI YVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCI 



RNVVFFDINMKGLDGIQGPI YVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCL 

RNVVFFDINMKGLDGIQGPI YVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCF 

HNRVLFDCTELGLDGLQGPIYVGTGCLFRRVALYSVDLPR 

RNTVFFDINMKGLDGIQGPVYVGTGCVFRRQALYGYNPPKGPK--RPKMVSCDC 

RNIVFFDINMKGQDGVQGPVYVGTGCCFNRQALYGYDPVLTEEDLE PNIIV 

RNVVFFDI NMKGLDG I QG P I YVG TGC VFRRYALYG Y DAP AKKKPPSKTCNCWPKWCC 

RNTVFFDINLRGLDGIQGPVYVGTGCVFNRTAI YGYEPPIKAK — K PGFLA 



RNIVFFDINMKGLDGIQGPVYVGTGCCFNRQALYGYDPVLTEEDLE PNIIV 

RNVVFFDINMKGLDGIQGPI YVGTGCVFRRQALYGFDAP KKKKPPGKTCNCWPKWCC 

RNWFFDINMLGLDGLQGPVYVGTGCVFNRQALYGYDPPVSEKRPK MTCDCWPSWCC 

NNTVFFDGNMRALDGVQGPVYVGTGTMFRRFALYGFDPP 

RNTVFFDINMKGLDGIQGPVYVGTGCVFKRQALYGYEPPKGPK — RPKMISCGC 

RNTVFFDINLRGLDGIQGPVYVGTGCVFNRTALYGYEPPLKPKHRK TGILS 

RNVVFFDI NMKGL DGLQG P I YVGTGC VFRRQAL YG FDAP KKKKG PRKTCNCWPKWCL 
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Figure 1 


(cont'd.) 
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900 


CCCCFGNRKTKKKTKTSKP 


KFEKI KKL F- KKKENQAPAY ALGE I DE A- 


-APG- 


SCCC SRNKNKKKTTKP 

CCCCFGNRKQKK TTKP 


KTEKKKRLFFK KAEN PSPAYALGEIDEG- 

KTEKKKLLFFKKEENQS PAYALGE I DEA- 


-APG- 
-APG- 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 
SEQ ID NO: 12 

SEQ ID NO: 14 -CPCFGSRKKYKE KNDANGEAASLKG 

SEQ ID NO: 16 -KSCCGSRKKGKGGNKK YS — DKKKAMGR — TESTVPI FNMEDIEEGVEG — Y 

SEQ ID NO: 18 LCC— GSRKKKN ANS KKEKKRKV — KHSEASKQIHALENIEAGN — EG- 

SEQ ID NO: 20 -SLCXG-KKKASKSKKR SSDKKKSNKH-- -VDSSVPVFNLEDIEEGVEGAGF 

SEQ ID NO:22 

SEQ ID NO: 23 -KSCCGSRKKGKSS-KK YNYEKRRGINR — SDSNAPLFNMEDIDEGFEG — Y 

SEQ ID NO: 24 LCC — GLRKKSK T KAKDKKT — NTKETSKQIHALENVDEGVIVPV- 

SEQ ID NO: 25 -CCCGGSRKKSKKKGEKKGLLGGLLYGKKKKMMGKNYVKKGSAPVFDLEEIEEGLEG— Y 

SEQ ID NO: 26 NPDKLLEKKESETEALTTSDFDPDLDVTQLPKRFGNSTLL AESIPI 

SEQ ID NO: 27 -CPCFGRRRKNKK FSKNDMNGDVAALGG 

SEQ ID NO: 28 -SLCGGSRKKSSKSSKK GSDKKKSGKH — VDSTVPVFNLEDIEEGVEGAGF 

SEQ ID NO: 29 LCF— GSRKNRK AKT VAADKKK KNREASKQIHALENIEEGRGHKV- 

901 960 
SEQ ID NO: 2 

SEQ ID NO: 4 AENEKASIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 

SEQ ID NO: 6 ET 

SEQ ID NO: 8 ADIEKAGIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 

SEQ ID NO: 10 AENEKAGIVNQQKLEKKFGQSSVFVTSTLLENGGTLKSAS PASLLKEAIHVISCGYEDKT 

SEQ ID NO: 12 

SEQ ID NO: 14 MDDDKEVLMSQMNFEKKFGQSSIFVTSTLMEEGGVPPSSSPAALLKEAIHVISCGYEDKT 

SEQ ID NO: 16 DD-ERTLLMSQKSLEKRFGQSPVFIAATFMEQGGIPPSTNPATLLKEAIHVISCGYEDKT 

SEQ ID NO: 18 TNNEKTSNLTQTKLEKRFGQSPVFVASTLLDDGGVPHGVSPASLLKEAIQVISCGYEDKT 

SEQ ID NO: 20 DD-EKSVLMSQMSLEKRFGQSAAFVASTLMEYGGVPQSSTPESLLKEAIHVISCGYEDKS 

SEQ ID NO:22 

SEQ ID NO: 23 DD-ERSILMSQRSVEKRFGQSPVFIAATFMEQGGIPPTTNPATLLKEAIHVISCGYEDKT 

SEQ ID NO: 24 SNVEKRSEATQLKLEKKFGQSPVFVASAVLQNGGVPRNASPACLLREAIQVISCGYEDKT 

SEQ ID NO: 25 EELEKSTLMSQKNFEKRFGQSPVFIASTLMENGGLPEGTNSTSLIKEAIHVISCGYEEKT 

SEQ ID NO: 26 AEFQGRPLADHPAV — KYGRPP GALR VPRDPLDATTVAESVSVI SCWYEDKT 

SEQ ID NO: 27 AEGDKEHLMFEMNFEKTFGQSSIFVTSTLMEEGGVPPSSSPAVLLKEAIHVISCGYEDKT 

SEQ ID NO: 28 DD-EKSLLMSQMSLEKRFGQSAVFVASTLMENGGVPQSATPETLLKEAIHVISCGYEDKT 

SEQ ID NO: 2 9 LNVEQSTEAMQMKLQKKYGQSPVFVASARLENGGMARNASPACLLKEAIQVISRGYEDKT 

961 1020 

SEQ ID NO: 2 HEDILTGFKMHARGWISIYCMPPRPCFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 4 GWGKDIGWIYGSVTEDILTGFKMHCHGWRSIYCIPKRAAFKGSAPLNLSDRLHQVLRWAL 

SEQ ID NO: 6 EWGKE I GWI YGS VTEDI LTG FKMHCRGWKSVYCTPTRPAFKGSAPINLS DRLHQVLRWAL 

SEQ ID NO: 8 DWGKEIGWI YGS ITEDILTGFKMHCHGWRS I YCIPKRPAFKGSAPLNLS DRLHQVLRWAL 

SEQ ID NO: 10 DWGKE I GWI YGS VTE D I LTG FKMHCHGWRS I YC I PKRVAFKGS APLNLS DRLHQVLRWAL 

SEQ ID NO: 12 WRP RRSL 

SEQ ID NO: 14 EWGLELGWI YGS ITEDI LTGFKMHCRGWRS I YCMPKRAAFKGTAP INLS DRLNQVLRWAL 

SEQ ID NO: 16 EWGKE I GWI YGSVTEDILTGFKMHARGWIS I YCMPPRPAFKGSAPI NLS DRLNQVLRWAL 

SEQ ID NO: 18 EWGKEVGWI YGS VTEDI LTG FKMHCHGWRS VYCIPKRPAFKGSAPINLS DRLHQVLRWAL 

SEQ ID NO: 20 EWGTEIGWI YGS VTEDI LTGFKMHARGWRS I YCMPKRPAFKGSAPINLS DRLNQVLRWAL 

SEQ ID NO: 22 

SEQ ID NO: 23 EWGKEIGWI YGSVTEDILTGFKMHARGWISI YCNPPRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 24 EWGKE I GWI YGS VTE D I LTG FKMHCHGWRS VYCMPKRAAFKGS API NLS DRLHQVLRWAL 

SEQ ID NO: 25 EWGKEIGWI YGS VTEDI LTGFKMHCRGWKSVYCVPKRPAFKGSAPI NLS DRLHQVLRWAL 

SEQ ID NO: 26 EWGDRVGWI YGSVTEDVVTGYRMHNRGWRSVYCITKRDSFRGSAPINLTDRLHQVLRWAT 
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Figure 1 (cont'd.) 

EWGTELGWIYGSITEDILTGFKMHCRGWRSI YCMPKRPAFKGSAPINLSDRLNQVLRWAL 
DWGSEIGWIYGSVTEDILTGFKMHARGWRSI YCMPKRPAFKGSAPINLSDRLNQVLRWAL 
EWGKEIGWIYGSVTEDILTGSKMHSHGWRHVYCTPKLAAFKGSAPINLSDRLHQVLRWAL 

1021 1080 
GSVEILFSRHCPIWYNYGG-RLKLLERMAYINTIVYPITSLPLIAYCVLPAICLLTNKFI 
GSIEIFFSNHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 
GSVEIFMSRHCPLWYAYGG-RLKWLERFAYTNTIVYPFTSIPLLAYCTIPAVCLLTGKFI 
GSVEIFFSKHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 
GSIEIFFSNHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

G CRL 

GSIEIFFSHHCPLWYGFKEKKLKWLERFAYANTTVYPFTSIPLVAYCILPAVCLLTDKFI 
GSIEIFLSRHCPLWYGYNG-KLKPLMRLAYINTIVYPFTSIPLIAYCTLPAFCLLTNKFI 
GSVEIFFSRHCPIWYGYGGG-LKLLERFSYINSVVYPWTSLPLLVYCTLPAICLLTGKFI 
GSVEILFSRHCPLWYGYGG-RLKFLERFAYINTTIYPLTSLPLLVYCILPAICLLTGKFI 

GSIEILLSRHCPIWYGYHG-RLRLLERIAYINTIVYPITSI PLIAYCILPAFCLITDRFI 
GSVEIFLSRHCPIWYGYGGG-LKWLERFSYINSVVYPWTSLPLIVYCSLPAVCLLTGKFI 
GSVEIFLSRHCPLWYGYGG-KLKWLERLAYINTIVYPFTSIPLLAYCTIPAVCLLTGKFI 

GSVEIFFSRNNAI LASKRLKFLQRLAYLNVGI YPFTSLFLILYCFL PAFSLFSGQFI 

GSVEIFFSRHSPLWYGYKGGKLKWLERFAYANTTIYPFTSI PLLAYCILPAICLLTDKFI 
GSVEILFSRHCPIWYGYSG-RLKWLERFAYVNTTIYPVTAI PLLMYCTLPAVCLLTNKFI 
GSVEIFLSRHCPIWYGYGGG-LKWLERLSYINSVVYPWTSLPLIVYCSLPAICLLTGKFI 

1081 1140 
IPEISNYAGMFFILMFASIFATGILELRWSGVGIEDWWRNEQFWVIGGTSAHLFAVFQGL 
TPELNNVASLWFMSLFICIFATSILEMRWSGVGIDDWWRNEQFWVIGGVSSHLFAVFQGL 
IPTLNNLASIWFIALFLSIIATSVLELRWSGVSIEDWWRNEQFWVIGGVSAHLFAVFQGF 
TPELTNVASIWFMALFICISVTGILEMRWSGVAIDDWWRNEQFWVIGGVSAHLFAVFQGL 
TPELNNVASLWFMSLFICIFATSILEMRWSGVGIDDWWRNEQFWVIGGVSSHLFAVFQGL 

MPPISTFAGLYFVALFSSIIATGILELKWSGVSIEEWWRNEQFWVIGGVSAHLFAVIQGL 
IPEISNFASMWFILLFVSIFTTSILELRWSGVSIEDWWRNEQFWVIGGTSAHLFAVFQGL 
VPEISNYASLVFMALFISIAATGILEMQWGGVSIDDWWRNEQFWVIGGVSSHLFALFQGL 
MPEISNLASIWFIALFLSIFATGILEMRWSGVGIDEWWRNEQFWVIGGISAHLFAVFQGL 

IPEISNYASIWFILLFISIAVTGILELRWSGVSIEDWWRNEQFWVIGGTSAKLFAVFQGL 
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LKVFAGIDTSFTVTSKAGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGVVAGISNAI 
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LKVLAGIDTNFTVTSKATDDE-E — FGELYTFKWTTLLIPPTTILIINIVGVVAGISDAI 
LKVLAGIDTNFTVTSKASDEDGD — FAELYVFKWTSLLI PPTT VLIVNLVGIVAGVS YAI 
LKVLAGVNTNFTVTSKAADD — GE- FSELYIFKWTSLLI PPMTLLIMNI VGWVGISDAI 
LKVLAGIDTNFTVTSKANDEEGD — FAELYMFKWTTLLIPPTTILIINMVGVVAGTSYAI 

LKVLAGIDTNFTVTSKATDEDGD — FAELYIFKWTALLIPPTTVLLVNLIGIVAGVSYAV 
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Figure 1 (cont'd..) 

LKVLAGVNTNFTVTSKAADD — GA-FSELYI FKWTTLLIPPTTLLIINIIGVIVGVSDAI 

LKVLAGVDTNFTVTAKAADDTE FGELYLFKWTTLLIPPTTLI ILNMVGWAGVSDAI 

LKVIAGIEISFTLTTKSGGDDNEDIYADLYIVKWSSLMIPPI VIAMVNIIAI WAFIRTI 
LKILAGIDTNFTVTSKATDDD-D— FGELYAFKWTTLLIPPTTVLIINIVGVVAGISDAI 
LKVLAGIDTNFTVTSKASDEDGD — FAELYMFKWTTLLI PPTTLLIINLVGWAGISYVI 
LKVLAGVDTNFTVTSKAADD— GE- FSDLYLFKWTSLLI PPMTLLI INVIGVIVGVSDAI 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours and Company 

<120> Plant Cellulose Synthases 

<130> BB-1170 

<140> 
<141> 

<150> 60/092,844 

<151> July 14, 1998 

<160> 29 

<170> Microsoft Office 97 

<210> 1 

<211> 1221 

<212> DNA 

<213> Hordeum vulaare 



<400> 1 

gcacgaggat 

catgccacca 

tcaagttctc 

ctggtacaat 

tgtttatcca 

cctcaccaac 

tatgtttgcc 

cgaggactgg 

cgcagtgttc 

ctcgaaggca 

tctcctcatt 

atcatatgcc 

ctcaatcrgg 

ccgcacgcca 

gtgggtgaag 

tggcgtcaac 

gggttggatg 

cccttcgtgc 

ttttttctcc 

cagaacgtat 

ngcatcaact 



attcttactg 
cgaccttgtr 
cggtgggctc 
tacggtgggc 
ataacatccc 
aaatttatca 
tccatctttg 
tggagaaacg 
cagggtctgc 
aacgacgagg 
cctccgacca 
atcaacagcg 
gtgatcctcc 
accatcgtca 
arcgaccctt 
tgctgatcgg 
ggattttgtt 
tgagaagcac 
ttttcttttc 
ctatagttct 
cttctgcaaa 



ggtttaaaat 
tcaagggttc 
ttgggtcagt 
ggttgaaact 
ttccacttat 
ttcccgagat 
ccacgggtat 
agcagttctg 
tgaaggtgrt 
atggcgattt 
ccgtccttgt 
gttaccagrc 
atctctaccc 
ttgtttggtc 
tcatatccga 
cgccgaagag 
gttgragatg 
aaaccgtgaa 
atctgtgata 
atacacacta 
a 



gcacgcaaga 
tgcgccaatc 
tgaaattctg 
tctggagagg 
cgcctattgt 
cagr aactat 
attggagctg 
ggttattggt 
ggccgggatt 
tgctgagtta 
gattaacctg 
ttggggtcca 
cttcctcaag 
catcctccta 
tacccagaaa 
tarctgcccc 
aagacggagt 
gcctacgaaa 
cctgttgttt 
tgacaccaac 



ggttggatat 
aatctctctg 
tttagcagac 
atggcttaca 
gtgcttcctg 
gctgggatgt 
cgatggagtg 
ggcacatctg 
gacaccaact 
tacgtgttca 
gtgggcatgg 
ctcttcggaa 
ggtctcatgg 
gcctccatct 
gccgtcgcca 
cctcgtgtaa 
tttatgtaag 
cctgcagcgt 
cttcttagag 
tatttatata 



caatctactg 
accgtctcaa 
attgtcctar 
tcaacaccat 
ctatctgtct 
tctttattct 
grgtcggcar 
cccatctttt 
tcacggttac 
agtggaccag 
tggcaggcat 
agctct .cci: 
ggaagcagaa 
tctccctcct 
tggggcagtg 
ataccggagg 
rtattattgc 
acattgtgat 
tatattargt 
aggcagctgt 



<210> 2 

<211> 304 

<212> PRT 

<213> Hordeum vulgare 

<400> 2 

His Glu Asp He Leu Thr Gly Phe Lys Met His Ala Arg Gly Trp He 

1 5 io 15 

Ser He Tyr Cys Met Pro Pro Arg Pro Cys Phe Lys Gly Ser Ala Pro 
20 25 * 30 

He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Tro Ala Leu Gly 
35 40 45 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1221 
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Ser Val Glu lie Leu Phe Ser Arg His Cys Pro lie Trp Tyr Asn Tyr 
50 55 ' 60 

Gly Gly Arg Leu Lys Leu Leu Glu Arg Met Ala Tyr lie Asn Thr lie 
65 70 75 80 

Val Tyr Pro lie Thr Ser Leu Pro Leu lie Ala Tyr Cys Val Leu Pro 
85 90 95 

Ala lie Cys Leu Leu Thr Asn Lys Phe lie lie Pro Glu lie Ser Asn 
100 105 HO 

Tyr Ala Gly Met Phe Phe He Leu Met Phe Ala Ser He Phe Ala Thr 
115 120 125 

Gly He Leu Glu Leu Arg Trp Ser Gly Val Gly He Glu Asp Trp Trp 
130 135 140 

Arg Asn Glu Gin Phe Trp Val He Gly Gly Thr Ser Ala His Leu Phe 
145 150 155 160 

Ala Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly lie Aso Thr Asn 
165 170 ' 175 

Phe Thr Val Thr Ser Lys Ala Asn Asp Glu Asp Gly Asp Phe Ala Glu 
180 18*5 190 

Leu Tyr Val Phe Lys Trp Thr Ser Leu Leu He Pro Pro Thr Thr Val 
195 200 205 

Leu Val He Asn Leu Val Gly Met Val Ala Gly He Ser Tyr Ala He 
210 215 220 

Asn Ser Gly Tyr Gin Ser Tro Gly Pro Leu Phe Gly Lys Leu Phe Phe 
225 230 235 240 

Ser He Trp Val He Leu His Leu Tyr Pro Phe Leu Lys Gly Leu Met 
245 250 255 

Gly Lys Gin Asn Arg Thr Pro Thr He Val lie Val Trp Ser lie Leu 
260 265 270 

Leu Ala Ser He Phe Ser Leu Leu Trp Val Lys He Asp Pro Phe He 
275 280 285 

Ser Asp Thr Gin Lys Ala Val Ala Met Gly Gin Cys Gly Val Asn Cys 
290 295 300 

<210> 3 

<211> 3776 

<212> DNA 

<213> Zea mays 

<400> 3 

gcgcgccgcg caggcgcaac gcaacaaagg gaaaccccag ccggaggagc aaaagctagc 60 

aagcgtgtcc ctccccctcc crcactcccg tttcattcca ttccccccca gacgccgcta 120 

ccgccgccgc cgcacgcacg cttgccccgg gatctggaga tctggtagcg ccagggggat 180 

ggaggccagc gccgggctgg tcgccggctc gcacaaccgg aacgagctcg tcgtcatccg 240 

ccgcgatggc gagccagggc cgaagcccat ggaccagcgg aacggccagg tgtgccagat 300 
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ttgcggcgac gacgtggggc gcaaccccga 
cgccttcccc atctgccggg actgctacga 
cccccagtgc aagacccgct tcaagcgcct 
ggaggaggac ggcgtcgacg acctggagaa 
ccagtacctc gccgagtcca tgctccacgc 
cgacggcgtg ccgcagccat tccaccccat 
gatggtcgat gacatcccgc cggaccagca 
ggggaagagg attcaccctc tcccgtacgc 
tatggaccct tccaaggatc tcgccgcata 
gatggagagc tggaagcaga agcaggagag 
cgatgatggt gatgatgcag atctaccact 
aaagatcccg cttccttcaa gccaaatcaa 
agtggttttg tgtttcttct tccactaccg 
tttatggctc atatctgtga tctgtgaaat 
gtttccaaag tggtttccta tcgagaggga 
tgacaaggaa gggcatcctt ctcaactcgc 
tcccttgaag gaacctccat tggtcactgc 
ttatccagtt gataaggttt catgctacgt 
tgaagcattg tctgaaacat ctgaattrgc 
tagccttgag cctcgtgctc cagagtggta 
caaggtggcg ccaaactttg ttagagaacg 
caaggtcaga atcaatgcct tggttgctaa 
aatgcaggat ggaactccat ggcccggaaa 
ggttttcctt ggtcaaagtg gtggccatga 
ttatgttrca agagaaaaac ggccaggcta 
tgcattggtc cgagtctctg ctgtactaac 
tgatcactat atcaataara gtaaggctat 
tttgcttgga aagaaagttt gctatgtgca 
ccatgatcga tatgctaaca gaaatgttgt 
tggtatccag ggcccaattt atgtgggtac 
tggctacgat gctcccaaaa caaagaagcc 
gtggtgcatt tgcrgttgct gttttggtaa 
taaacctaaa tttgagaaga taaagaaact 
atatgctctt ggtgaaattg atgaagccgc 
tgtaaatcaa cagaagttgg aaaagaaatt 
acttcttgag aatggtggaa ccctgaagag 
tatacatgtc atcagttgtg gatatgaaga 
gatttatgga tcagtcacag aagatattct 
gcggtcaatt tactgcatac ctaaacgggc 
ttccgatcgt cttcaccagg ttcttcggtg 
caaccactgc cctctctggt atgggtatgg 
gtacattaac tccatcgtat acccttggac 
gcctgccatc tgcttgctga cagggaaatt 
cctctggttc atgtcacttt tcatctgcat 
gagtggtgta ggcatcgatg actggtggag 
gtcttcacat ctctttgctg tgttccaggg 
gagcttcact gtgacatcca agggcggaga 
caaatggacg acccttctga tacctccgac 
ggtagctggc atctccaatg cgatcaacaa 
gaagctcttc tttgcatttt gggtgatcgt 
tgggaggcag aacaggacgc caacgattgt 
cttctcgctg ctttgggtcc ggatcgaccc 
ggaggagtgt ggtctggatt gcaactagga 
tggtcgaaga agtatttttg cagatgtttt 
cctctgtaga tagaaacaag gggagaaggg 
atggtggtgg tggtggtggg cggctcagcc 
ctgcggcaac cttgtgcagt tcggccacga 
caatcgatga ccgagttcaa ttgttcagca 



cggggagccg ttcgtggcct gcaacgagtg 360 
gtacgagcgc cgcgagggca cgcagaactg 420 
caaggggtgc gcgcgcgtgc ccggggacga 480 
cgagttcaac tggagcgaca agcacgactc 540 
ccacatgagc tacggccgcg gcgccgacct 600 
ccccaatgtt cccctcctca ccaacggaca 660 
cgcccttgtg ccctcgttcg tgggtggcgg 720 
ggatcccaac cttcctgtgc aaccgaggtc 780 
tggctacggg agcgtagcat ggaaggagag 840 
gatgcaccag acgaggaacg atggcggcgg 900 
aatggatgaa gctagacagc cattgtccag 960 
cccctatagg atgattataa taattcggct 1020 
agtgatgcat ccggtgcctg atgcatttgc 1080 
ttggtttgcc atgtcttgga ttcttgacca 1140 
aacctatctt gaccggctga gtttaaggtt 1200 
ccctgttgat ttctttgtca gtacggttga 1260 
taatactgtt ctatctatcc tttcggtgga 1320 
ttctgatgat ggtgctgcca tgctgacatt 1380 
aaagaaatgg gttccrttct gcaaaagata 1440 
cttccaacag aagatagact acctgaaaga 1500 
gagagcaatg aagagagagt atgaggaatt 1560 
agcccaaaag gttcctgagg aaggarggac 1620 
taatgtccgt gatcatcctg gaatgattca 1680 
tgtggaagga aatgagcrgc ctcgattggt 1740 
caaccatcac aagaaggctg gtgctatgaa 1800 
taatgctcct tatttgctga acttggattg 1860 
aaaggaagca atgtgtttta tgatggatcc 1920 
gtttcctcaa agatttgatg ggattgatcg 1980 
ctttttcgat atcaacatga aaggtttgga 2040 
tggatgtgtc ttcagaaggc aggcattata 2100 
accatcaaga acttgcaact gctggccaaa 2160 
caggaagacc aagaagaaga ccaagacctc 2220 
ttttaagaaa aaggaaaatc aagcccctgc 2280 
tccaggagct gaaaatgaaa aggctagtat 2340 
tggccagtct tcagtttttg trgcatccac 2400 
tgccagtcca getter cttc tgaaggaagc 24 60 
caaaacaggc tggggaaaag atattggttg 2520 
tactgggttt aagatgeact gccatggttg 2580 
cgccttcaaa ggttccgcac ctcrcaatct 2640 
ggctcttggt tcaattgaaa ttttcrtcag 2700 
tggtggacta aagttcctgg aaaggttttc 2760 
atctatcccg ctcrtggcct attgeacatt 2820 
tatcacgcca gagcttaaca atgttgccag 2880 
ttttgctacg agcatcctgg aaatgagatg 2940 
aaacgagcag ttttgggtca ttggaggcgt 3000 
actcctcaag gtcatagctg gtgtagacac 3060 
cgacgaggag ttctcagagc tgtacacatt 3120 
aaccctgctc ctactgaact tcattggagt 3180 
eggatatgaa teatggggee ccctgttcgg 3240 
ccatctttac ccgttcctca agggtctggt 3300 
cattgtctgg tccatcctcc tggcttcgat 3360 
gttccttgcg aaggatgatg gtcccctgtt 3420 
ggtcagcacg tggacttccc cgtcagtgtg 3480 
gtgeccatat ttcttttttc aatttttgtc 3540 
gaaaaaaagt acttgtattt cttttgttcc 3600 
tcgtgagtgc agtattgggc aaaceggagg 3660 
atatactagg gaagatcgcg accaatcaat 3720 
aaaaaaaaaa aaaaaaaaaa aaaaaa 3776 
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<210> 4 

<211> 1148 

<212> PRT 

<213> Zea mays 

<400> 4 

Arg Ala Ala Gin Ala Gin Arg Asn Lys Gly Lys Pro Gin Pro Glu Glu 
15 10 15 

Gin Lys Leu Ala Ser Val Ser Leu Pro Leu Pro His Ser Ara Phe lie 
20 25 30 

Pro Phe Pro Pro Arg Arg Arg Tyr Arg Arg Arg Arg Thr His Ala Cys 
35 40 45 



Pro Gly lie Trp Arg Ser Gly Ser 

50 55 

Gly Leu Val Ala Gly Ser His Asn 

65 70 

Arg Asp Gly Gla Pro Gly Pro Lys 

85 



Ala Arg Gly Met Glu Ala Ser Ala 
60 

Arg Asn Glu Leu Val Val lie Arg 

75 80 

Pro Met Asp Gin Arg Asn Gly Gin 

90 95 



Val Cys Gin lie Cys Gly Asp Asp Val Gly Arg Asn Pro Asp Gly Glu 
100 105 110 

Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro lie Cys Arg Asp Cys 
115 120 125 

Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn Cys Pro Gin Cys Lvs 
130 135 140 

Thr Arg Phe Lys Arg Leu Lys Gly Cys Ala Arg Val Pro Gly Asp Glu 
145 150 155 160 

Glu Glu Asp Gly Val Asp Asp Leu Glu Asn Glu Phe Asn Trp Ser Asp 
165 170 175 

Lys His Asp Ser Gin Tyr Leu Ala Glu Ser Met Leu His Ala Kis Met 
180 185 190 

Ser Tyr Gly Arg Giy Ala Asp Leu Asp Gly Val Pro Gin Pro Phe His 
195 200 205 

Pro lie Pro Asn Val Pro Leu Leu Thr Asn Gly Gin Met Val Asp Asv 
210 215 220 

lie Pro Pro Asp Gin His Ala Leu Val Pro Ser Phe Val Glv Gly Gly 
225 230 235 " 240 

Gly Lys Arg lie His Pro Leu Pro Tyr Ala Asp Pro Asn Leu Pro Val 
245 25C 255 

Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr Gly Tyr 
260 265 * 270 

Gly Ser Val Ala Trp Lys Glu Arg Met Glu Ser Trp Lys Gin Lys Gin 
275 280 285 
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Glu Arg Met His Gin Thr Arg Asn Asd Gly Gly Gly Asd Asp Gly Asp 
290 295 300 

Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro Leu Ser Arg 
305 310 315 320 

Lys lie Pro Leu Pro Ser Ser Gin lie Asn Pro Tyr Arg Met lie lie 
325 330 335 

lie lie Arg Leu Val Val Leu Cys Phe Phe Phe His Tyr Arg Val Met 
340 345 350 

His Pro Val Pro Asp Ala Phe Ala Leu Trp Leu lie Ser Val lie Cys 
355 360 365 

Glu He Trp Phe Ala Met Ser Trp He Leu Asp Gin Phe Pro Lys Trp 
370 375 380 

Phe Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Phe 
385 390 395 400 

Asp Lys Glu Gly His Pro Ser Gin Leu Ala Pro Val Asd Phe Phe Val 
405 410 " 415 

Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val Thr Ala Asn Thr 
420 425 430 

Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Asp Lys Val Ser Cys 
435 440 445 

Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu Ala Leu Ser 
450 455 460 

Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Cys Lys Arg Tyr 
465 470 475 480 

Ser Leu Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin Lys He Asp 
485 490 495 

Tyr Leu Lys Asp Lys Val Ala Pro Asn Phe Val Ara Glu Arg Arg Ala 
500 505 " 510 

Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val 
515 520 525 

Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met Gin Asp Gly 
530 535 540 

Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Glv Met He Gin 
545 550 555 " 560 

Val Phe Leu Gly Gin Ser Gly Gly His Asp Val Glu Gly Asn Glu Leu 
565 570 575 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Tyr Asn His 
580 585 590 

His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Ala Val 
595 600 605 
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Leu Thr Asn Ala Pro Tyr Leu Leu Asn Leu Asp Cys Asp His Tyr lie 
610 615 620 

Asn Asn Ser Lys Ala lie Lys Glu Ala Met Cys Phe Met Met Asp Pro 
625 630 635 640 

Leu Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
645 650 655 

Gly He Asp Arg His Asp Arg Tyr Ala Asn Arg Asn Val Val Phe Phe 
660 665 670 

Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro He Tyr Val 
675 680 " 685 

Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Asp Ala 
690 695 700 

Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn Cys T rp p ro Lys 
705 710 715 " 720 

Trp Cys He Cys Cys Cys Cys Phe Gly Asn Ara Lys Thr Lys Lvs Lys 
7 25 730 735 

Thr Lys Thr Ser Lys Pro Lys Phe Glu Lys He Lys Lys Leu Phe Lys 
740 745 750 

Lys Lys Glu Asn Gin Ala Pro Ala Tyr Ala Leu Gly Glu He Asp Glu 
755 760 765 

Ala Ala Pro Gly Ala Glu Asn Glu Lys Ala Ser He Val Asn Gin Gin 
770 775 780 

Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Ala Ser Thr 
785 790 795 800 

Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala Ser Leu 
805 810 815 

Leu Lys Glu Ala He His Val He Ser Cvs Glv Tyr Glu Asp Lys Thr 
820 825 * 830 

Gly Trp Gly Lys Asp He Gly Trp He Tyr Gly Ser Val Thr Glu Asp 
835 840 845 

He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser He Tyr 
850 855 860 

Cys He Pro Lys Arg Ala Ala Phe Lys Gly Ser Ala Pro Leu Asn Leu 
865 870 875 880 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Giv Ser He Glu 
885 890 ' 895 

He Phe Phe Ser Asn His Cys Pro Leu Trp Tvr Gly Tyr Gly Gly Gly 
900 905 " 910 

Leu Lys Phe Leu Glu Arg Phe Ser Tyr He Asn Ser He Val Tyr Pro 
915 920 925 
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Trp Thr Ser He Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala He Cys 
930 935 940 

Leu Leu Thr Gly Lys Phe He Thr Pro Glu Leu Asn Asn Val Ala Ser 
945 950 955 960 

Leu Trp Phe Met Ser Leu Phe He Cys He Phe Ala Thr Ser He Leu 
965 970 975 

Glu Met Arg Trp Ser Gly Val Gly He Asp Asp Trp Trp Arg Asn Glu 
980 985 990 

Gin Phe Trp Val lie Gly Gly Val Ser Ser His Leu Phe Ala Val Phe 
995 1000 1005 

Gin Gly Leu Leu Lys Val He Ala Gly Val Asp Thr Ser Phe Thr Val 
1010 1015 " 1020 

Thr Ser Lys Gly Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr Thr Phe 
1025 1030 1035 1040 

Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu Leu Asn 
1045 1050 1055 

Phe He Gly Val Val Ala Gly He Ser Asn Ala lie Asn Asn Gly Tyr 
1060 1065 1070 

Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Tro Val 
1075 1080 1085 

He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg Gin Asn 
1090 1095 1100 

Arg Thr Pro Thr He Val He Val Trp Ser He Leu Leu Ala Ser He 
1105 1110 1H5 H20 

Phe Ser Leu Leu Trp Val Arg He Asp Pro Phe Leu Ala Lvs Asp Asp 
1125 H30 J H35 

Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1140 _ 1145 

<210> 5 
<211> 1189 
<212> DNA 
<213> Zea mays 

<400> 5 

gcacgagacc gagtggggca aggagattgg gtggatctat gggtcggtga cagaggatat 60 

cctgacgggg ttcaagatgc actgccgggg gtggaagtcc gtgtactgca cgccgacacg 120 

gccggcgttc aaggggtcgg cgcccatcaa cttgtctgat cgtctccacc aggtgctgcg 180 

ctgggcgctg gggtccgtgg agatcttcat gagccgccac tgcccgctct ggtacgccta 240 

cggcggccgg ctcaagtggc tggagcgctt cgcctacacc aacaccatcg tgtacccctt 300 

cacctccatc ccgctcctcg cctactgcac catccccgcc gtctgcctgc tcaccggcaa 360 

gttcatcatt cccacgctga acaacctcgc cagcatctgg ttcatcgcgc rcttcctgtc 420 

catcatcgcg acgagcgtcc tggagctgcg gtggagcggg gtgagcatcg aggactggtg 480 

gcgcaacgag cagttctggg tcatcggcgg cgtgtccgcg catctcttcg ccgtgttcca 540 

gggcttcctc aaggttctgg gcggcgtgga caccagcttc accgtcacct ccaaggcggc 600 

cggcgacgag gccgacgcct tcggggacct ctacctcttc aagtggacca ccctgcxggt 660 

gccccccacc acgctcatca tcatcaacat ggtgggcatc gtggccggcg tgtccgacgc 720 
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cgtcaacaac ggctacggct cctggggccc gctcttcggc aagctcttct tctccttctg 780 
ggtcatcgtc cacctctacc cgttccccaa ggggctcatg gggaggcaga accggacgcc 840 
caccatcgtc gtgctctggt ccatcctcct cgcctccatc ttctcgctcg tctgggtcag 900 
gatcgacccg tttatcccga aggccaaggg ccccatcctc aagccatgcg gagtcgagtg 960 
ctgagctcac ctagctacct tcttgttgca tgtacggacg ccgccgtgcg tttggacata 1020 
caggcactrt tgggccaggc ractcatgtt cgactttttt tttaattttg tacaagattt 1080 
gtgatcgagt gactgagtga gacagagtgt tgggtgtaag aactgtgatg gaattcactc 1140 
aaattaatgg acattttttt tcttcaactg caaaaaaaaa aaaaaaaaa 1189 

<210> 6 

<211> 320 

<212> PRT 

<213> Zea mays 

<400> 6 

His Glu Thr Glu Trp Gly Lys 
1 5 

Thr Glu Asp lie Leu Thr Gly 
20 



Glu He Gly Trp He Tyr Gly Ser Val 

10 15 

Phe Lys Met His Cvs Arg Gly Tro Lys 

25 30 



Ser Val Tyr Cys Thr Pro Thr Arg 

35 40 

He Asn Leu Ser Asp Arg Leu His 
50 55 

Ser Val Glu He Phe Met Ser Arg 
65 70 

Gly Gly Arg Leu Lys Trp Leu Glu 
85 



Pro Ala Pne Lys Gly Ser Ala Pro 
45 

Gin Val Leu Arg Trp Ala Leu Gly 
60 

His Cys Pro Leu Trp Tyr Ala Tyr 
75 80 

Arg Phe Ala Tyr Thr Asn Thr He 
90 95 



Val Tyr Pro Phe Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr lie Pro 
100 105 " HO 

Ala Val Cys Leu Leu Thr Gly Lys Phe He He Pro Thr Leu Asn Asn 
115 120 125 

Leu Ala Ser lie Trp Phe lie Ala Leu Phe Leu Ser He lie Ala Thr 
130 135 140 



Ser Val Leu Glu Leu Arg Trp Ser 
145 150 

Arg Asn Glu Gin Phe Trp Val He 
165 

Ala Val Phe Gin Gly Phe Leu Lys 
180 

Phe Thr Val Thr Ser Lys Ala Ala 
195 200 

Asp Leu Tyr Leu Phe Lys Trp Thr 
210 215 

Leu He He He Asn Met Val Gly 
225 230 



Gly Val Ser He Glu Asp Trp Trp 

155 160 

Gly Gly Val Ser Ala His Leu Phe 

170 175 

Val Leu Gly Gly Val Asp Thr Ser 

185 190 

Gly Asp Glu Ala Asp Ala Phe Gly 
2 05 

Thr Leu Leu Val Pro Pro Thr Thr 
220 

He Val Ala Gly Val Ser Asp Ala 

235 240 
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Val Asn Asn Gly Tyr Giy Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe 
245 250 255 

Phe Ser Phe Trp Val lie Val His Leu Tyr Pro Phe Leu Lys Gly Leu 
260 265 270 

Met Gly Arg Gin Asn Arg Thr Pro Thr He Val Val Leu Tid Ser He 
275 280 285 

Leu Leu Ala Ser He Phe Ser Leu Val Trp Val Arg He Asp Pro Phe 
290 295 300 



He Pro Lys Ala Lys Gly Pro He Leu Lys Pro Cys Gly Val Glu Cys 



305 



310 



315 



320 



<210> 7 

<211> 3786 

<212> DNA 

<213> Zea mays 



<400> 7 

ccacagctca tataccaaga gccggagcag crtagcgcag cccagagcgg cgccgcgcca 60 
agcacaaccc ccacccgcca cagccgcgtg cgcatgtgag cggtcgccgc ggccgggaga 120 
ccagaggagg ggaggactac gtgcarttcg ctgtgccgcc gccgcggggt tcgrgcgcga 180 
gcgagarccg gcggggcggg gcggggggcc tgagatggag gctagcgcgg ggctagtgac 240 
cggctcgcat aaccggaacg agctggtggt gatccgccgc gaccgcgagt cgggagccgc 300 
gggcggcggc gcggcgcgcc gggcggaggc gccgtgccag atatgcggcg acgaggtcgg 360 
ggtgggcttc gacggggagc ccttcgtggc grgcaacgag tgcgccttcc ccgtctgccg 420 
cgccrgctac gagtacgagc gccgcgaggg ctcgcaagcg tgcccgcagt gcaggacccg 480 
ctacaagcgc ctcaagggct gcccgcgggt ggccggcgac gaggaggagg acggcgtcga 54 0 
cgacctggag ggcgagttcg gcctgcagga cggcgccgcc cacgaggacg acccgcagta 600 
cgtcgccgag tccatgctca gggcgcagat gagctacggc cgcggcggcg acgcgcaccc 660 
cggcttcagc cccgtcccca acgtgccgct cctcaccaac ggccagatgg ttgatgacat 720 
cccgccggag cagcacgcgc tcgtgccgtc ctacatgagc ggcggcggcg gcgggggcaa 780 
gaggatccac ccgctccctt tcgcagatcc caaccttcca gtgcaaccga gatccatgga 840 
cccgtccaag gatctggccg cctacggata rggcagcgtg gcctggaagg agagaatgga 900 
gggctggaag cagaagcagg agcgcctgca gcatgtcagg agcgagggtg gcggtgatrg 960 
ggatggcgac gargcagatc tgccactaat ggatgaagct aggcagccat tgtccagaaa 1020 
agtccctara icatcaagcc gaattaatcc ctacaggatg artatcgtta tccggttagt 1080 
ggttttgggt ttcttcttcc actaccgagt gatgcatccg gcgaaagatg catttgcatt 1140 
gtggctcata tctgtaatct gtgaaatctg gtttgcgatg tcctggactc ttgatcagtt 1200 
cccaaagtgg cttccaatcg agagagagac ttacctggac cgtttgtcac taaggtttga 1260 
caaggaaggt caaccctctc agcttgctcc aatcgacttc tttgtcagta cggttgatcc 1320 
cacaaaggaa ccrcccttgg tcacagcgaa cactgtcctt tccatccttt ctgtggatta 1380 
tccggttgag aaggtctcct gctatgtttc tgatgatggt gctgcaatgc ttacgtttga 1440 
agcattgtct gaaacatctg aatttgcaaa gaaatgggtt cctttcagca aaaagtttaa 1500 
tatcgagcct cgtgctcctg agtggtactt ccaacagaag atagactacc tgaaagacaa 1560 
ggttgctgct tcatttgtta gggagaggag ggcgatgaag agagaacacg aggaattcaa 1620 
ggtaaggatc aatgccttgg ttgcaaaagc ccaaaaggtt cctgaggaag gatggacaat 1680 
gcaagatgga agcccctggc ccggaaacaa cgtacgcgat catcctggaa tgattcaggt 1740 
attccttggc caaagtggcg gtcgtgatgt ggaaggaaar gagttgcctc gcctggttta 1800 
tgtctcgaga gaaaagaggc caggttataa ccatcacaag aaggctggtg ccatgaatgc 1860 
actggtccgt gtctctgctg tcttatcaaa tgctgcatac ctattgaact tggactgtga 1920 
tcactacatc aacaatagca aggccataaa agaggctatg tgttrcatga tggatccttr 1980 
ggtggggaag aaagtgtgct atgtacagtt ccctcagagg tttgatggta ttgacaaaaa 2040 
tgatcgatac gctaacagga acgttgtctr ttrtgacatc aacargaaag atttggacgg 2100 
tattcaagga cccatttatg tgggtactgg atgtgttttc agacggcagg cactgtatgg 2160 
ttatgatgct cctaaaacga agaagccacc atcaagaact tgcaactgct agcccaagrg 2220 
gtgcctctct tgctgctgca gcaggaacaa gaataaaaag aagactacaa aaccaaagac 2280 
ggagaagaag aaaagattat ttttcaagaa agcagaaaac ccatctcctg catatgcttr 2340 
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gggtgaaatt 

acagaaacta 
gaacggaggg 
tatcagctgc 
atcgatcaca 
ttactgcatc 
tcttcaccag 
cccactttgg 
ctccatcgtt 
ctgcctgctc 
catggcactt 
ggccatcgac 
tctgttcgcg 
cgtgacgtcg 
caccctgctg 
gatctcgaac 
cttcgccttc 
gaacaggacg 
cctgtgggtc 
tggccrggac 
ttacgcctga 
tgrccacggc 
tgagtaaatr. 
cctttttaca 
ataaaa 



gatgaaggtg 
gagaagaaat 
accctgaaga 
ggctacgaag 
gaggatatct 
ccgaagcggc 
gtccttcgct 
tacggatacg 
tatccctgga 
acggggaagt 
trcatctgca 
gactggtgga 
gtgttccagg 
aaggccgggg 
atacccccga 
gcgatcaaca 
tgggtgatcg 
ccgacgatcg 
cgcgtcgacc 
tgcaactgaa 
ttttttgttg 
atcxctgctg 
aaaagtttra 
atctatatat 



ctccaggtgc 
ttgggcagtc 
gcgcaagtcc 
acaagaccga 
tgactggatt 
ctgcattcaa 
gggcccttgg 
gcggcgggct 
cgtccattcc 
ttatcacacc 
tctccgtgac 
ggaacgagca 
gcctgctgaa 
acgacgagga 
ccacgctcct 
acgggtacga 
tccacctgta 
tcatcgtctg 
cgttcctcgc 
gtgggggccc 
ttgttgttgt 
rgtccattgg 
aagttataca 
tagcgacaaa 



tgatatcgag 
ttctgrttrt 
agcttctctt 
ctggggaaaa 
taagatgcac 
aggttctgcg 
gtccgr cgaa 
aaaattcctg 
tctccrggct 
agagcttacc 
cggcatcctg 
grtctgggtc 
ggtgttcgcc 
gttctcggag 
cctgctgaac 
gtcgtggggc 
cccgttcctc 
gtccatcctg 
caagagcaac 
cctgtcactc 
tggaattctt 
agcaggagag 
gtgatgcaca 
ggacatattg 



aaggccggaa 
gr cgcatcaa 
crgaaggaag 
gagattggct 
rgccatggct 
cctctgaacc 
attttcttca 
gaaaggtttt 
tactgtacct 
aatgtcgcca 
gaaatgaggt 
atcggaggcg 
ggcatcgaca 
ctgtacacgt 
ttcatcgggg 
cccctgttcg 
aagggtctgg 
ctggcctcga 
ggcccgctcc 
gaagttcrgt 
tgctgtagat 
aggtgccrgc 
ttccagrgcc 
gttaggagtt 



tcgtaaatca 
cacttcttga 
ctatacatgt 
ggatttacgg 
ggcggtctat 
tttccgaccg 
gcaagcactg 
cttatatcaa 
tgcctgccat 
gtatctggtt 
ggagtggcgt 
tttcggcgca 
cgagcttcac 
tcaagtggac 
tggtggccgg 
ggaagctctt 
tggggaggca 
tcttctcgct 
tggaggagtg 
cacgggcgaa 
agaaaccaca 
tgctgttrgt 
cagtgtattc 
tgatrcrttt 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3786 



<210> 8 

<211> 1165 

<212> PRT 

<213> Zea mays 

<400> 8 

His Ser Ser Tyr Thr Lys Ser Arg Ser Ser Leu Ala Gin Pro Arg Ala 
1 5 10 15 



Ala Pro Arg Gin Ala Gin Pro Pro Pro Ala 
20 25 



?hr Ala Ala Cys Ala Cys 
30 



Glu Arg Ser Pro Arg Pro Gly Asp Gin Arg 
35 40 



Arg Gly Gly Leu Arg Ala 
45 



Phe Arg Cys Ala Ala Ala Ala Gly Phe Val 
50 55 



Arg Glu Arg Asp Pro Ala 
60 



Gly Arg Gly Gly Gly Pro Glu Met Glu Ala 
65 70 



Ser Ala Gly Leu Val Ala 
75 80 



Gly Ser His Asn Arg Asn Glu Leu Val Val 
85 90 

Ser Gly Ala Ala Gly Gly Gly Ala Ala Arg 
100 105 



lie Arg Arg Asp Arg Glu 
95 

Arg Ala Glu Ala Pro Cys 
110 



Gin lie Cys Gly Asp Glu Val Gly Val Gly 
115 120 



Phe Asp Gly Glu Pro Phe 
125 



Val Ala Cys Asn Glu Cys Ala Phe Pro Val 
130 135 



Cys Arg Ala Cys Tyr Glu 
140 
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Tyr Glu Arg Arg 
145 

Tyr Lys Arg Leu 



Asp Gly Val Asp 
180 

Ala His Glu Asp 
195 

Gin Met Ser Tyr 

210 

Val Pro Asn Val 
225 

Pro Pro Glu Gin 



Gly Gly Gly Lys 
260 

Pro Val Gin Pro 
275 

Gly Tyr Gly Ser 
290 

Lys Gin Glu Arg 
305 

Asp Gly Asp Asp 



Leu Ser Arg Lys 
340 

Met lie lie Val 
355 

Arg Val Met His 
370 

Val lie Cys Glu 
385 

Pro Lys Trp Leu 



Leu Arg Phe Asp 
420 

Phe Phe Val Ser 
435 

Ala Asn Thr Val 
450 



Glu Gly Ser Gin 
150 

Lys Gly Cys Pro 
165 

Asp Leu Glu Gly 



Asp Pro Gin Tyr 
200 

Gly Arg Gly Gly 
215 

Pro Leu Leu Thr 
230 

His Ala Leu Val 
245 

Arg lie His Pro 



Arg Ser Met Asp 
280 

Val Ala Trp Lys 
295 

Leu Gin His Val 
310 

Ala Asp Leu Pro 
325 

Val Pro lie Ser 



lie Arg Leu Val 
360 

Pro Ala Lys Asp 
375 

lie Trp Phe Ala 
390 

Pro lie Glu Arg 
405 

Lys Glu Gly Gin 



Thr Val Asp Pro 
440 

Leu Ser lie Leu 
455 



Ala Cys Pro Gin 
155 

Arg Val Ala Gly 
170 

Glu Phe Gly Leu 
185 

Val Ala Glu Ser 



Asp Ala His Pro 
220 

Asn Gly Gin Met 
235 

Pro Ser Tyr Met 
250 

Leu Pro Phe Ala 
265 

Pro Ser Lys Asp 



Glu Arg Met Glu 
300 

Arg Ser Glu Gly 
315 

Leu Met Asp Glu 
330 

Ser Ser Arg lie 
345 

Val Leu Gly Phe 



Ala Phe Ala Leu 
380 

Met Ser Trp lie 
395 

Glu Thr Tyr Leu 
410 

Pro Ser Gin Leu 
425 

Thr Lys Glu Pro 



Ser Val Asp Tyr 
460 



Cys Arg Thr Arg 
160 

Asp Glu Glu Glu 
175 

Gin Asp Gly Ala 
190 

Met Leu Arg Ala 
205 

Gly Phe Ser Pro 



Val Asp Asp lie 
240 

Ser Gly Gly Gly 
255 

Asp Pro Asn Leu 
270 

Leu Ala Ala Tyr 
285 

Gly Trp Lys Gin 



Gly Gly Asp Trp 
320 

Ala Arg Gin Pro 
335 

Asn Pro Tyr Arg 
350 

Phe Phe His Tyr 
365 

Trp Leu lie Ser 



Leu Asp Gin Phe 
400 

Asp Arg Leu Ser 
415 

Ala Pro lie Asp 
430 

Pro Leu Val Thr 
445 

Pro Val Glu Lys 
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Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu 
465 470 475 480 

Ala Leu Ser Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Ser 
485 490 495 

Lys Lys Phe Asn lie Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin 
500 505 510 

Lys lie Asp Tyr Leu Lys Asp Lys Val Ala Ala Ser Phe Val Arg Glu 
515 520 525 

Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg lie Asn 
530 535 540 

Ala Leu Val Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met 
545 550 555 560 

Gin Asp Gly Ser Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly 
565 570 575 

Met lie Gin Val Phe Leu Gly Gin Ser Gly Gly Arg Asp Val Glu Gly 
580 585 590 

Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Ara Glu Lvs Arg Pro Gly 
595 600 " 605 

Tyr Asn His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val 
610 615 620 

Ser Ala Val Leu Ser Asn Ala Ala Tyr Leu Leu Asn Leu Asp Cys Asp 
625 630 635 640 

His Tyr lie Asn Asn Ser Lys Ala lie Lys Glu Ala Met Cys Phe Met 
645 650 655 

Met Asp Pro Leu Val Gly Lys Lys Val Cys Tvr Val Gin Phe Pro Gin 
660 665 " ' 670 

Arg Phe Asp Gly lie Asp Lys Asn Asp Arg Tyr Ala Asn Ara Asn Val 
675 680 " 685 

Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp Gly lie Gin Gly Pro 
690 695 700 

He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly 
705 710 715 720 

Tyr Asp Ala Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn Cys 
725 730 735 

Trp Pro Lys Trp Cys Leu Ser Cys Cys Cys Ser Arg Asn Lys Asn Lys 
740 745 750 

Lys Lys Thr Thr Lys Pro Lys Thr Glu Lys Lys Lys Arg Leu Phe Phe 
755 760 765 

Lys Lys Ala Glu Asn Pro Ser Pro Ala Tyr Ala Leu Gly Glu lie Asp 
770 775 780 
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Glu Gly Ala Pro Gly Ala Asp lie Glu Lys Ala Gly lie Val Asn Gin 
785 790 795 800 

Gin Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Ala Ser 
805 810 815 

Thr Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala Ser 
820 825 830 

Leu Leu Lys Glu Ala lie His Val lie Ser Cys Gly Tyr Glu Asp Lys 
835 840 845 

Thr Asp Trp Gly Lys Glu lie Gly Trp lie Tyr Gly Ser lie Thr Glu 
850 855 " 860 

Asp lie Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser lie 
865 870 875 ' 880 

Tyr Cys lie Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro Leu Asn 
885 890 895 

Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val 
900 905 " 910 

Glu lie Phe Phe Ser Lys His Cvs Pro Leu Trp Tyr Gly Tyr Gly Gly 
915 920 " 925 

Gly Leu Lys Phe Leu Glu Arg Phe Ser Tyr lie Asn Ser lie Val Tyr 
930 935 940 

Pro Trp Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala lie 
945 950 "* 955 960 

Cys Leu Leu Thr Gly Lys Phe lie Thr Pro Glu Leu Thr Asn Val Ala 
965 970 975 

Ser lie Trp Phe Met Ala Leu Phe lie Cys lie Ser Val Thr Gly lie 
980 985 990 

Leu Glu Met Arg Trp Ser Gly Val Ala lie Asp Asp Tro Tro Arg Asn 
995 1000 " 1005 

Glu Gin Phe Trp Val lie Gly Gly Val Ser Ala His Leu Phe Ala Val 
1010 1015 1020 

Phe Gin Gly Leu Leu Lys Val Phe Ala Gly lie Asp Thr Ser Phe Thr 
1025 1030 1035 1040 

Val Thr Ser Lys Ala Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr Thr 
1045 1050 1055 

Phe Lys Trp Thr Thr Leu Leu lie Pro Pro Thr Thr Leu Leu Leu Leu 
1060 1065 1070 

Asn Phe lie Gly Val Val Ala Gly lie Ser Asn Ala lie Asn Asn Gly 
1075 1080 1085 

Tyr Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp 
1090 1095 1100 
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Val lie Val Kis Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg Gin 
1105 1110 1115 1120 



Asn Arg Thr Pro Thr lie Val lie Val Trp Ser lie Leu Leu Ala Sei 
1125 1130 1135 



lie Phe Ser Leu Leu Trp Val Arg Val Asp Pro Phe Leu Ala Lys 
1140 1145 1150 



Ser 



Asn Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1155 1160 * 1165 



<210> 9 

<211> 3936 

<212> DNA 

<213> Zea mays 



<400> 9 

cttctccctc gtcggtgcgg cgtggcgcgg 
atgaggatct gctgctagag tgagaggagc 
gcggaagtgg aggggaggaa gcgatggagg 
accgcaacga gctcgtcgrc arccgccgcg 
agcagaacgg gcaggtgtgc cagatrtgcg 
accccttcgt ggcgtgcaac gagtgcgcct 
agcgccggga gggcacgcag aactgccccc 
gctgccaacg tgtgaccggt gacgaggagg 
tcaactggga cggccatgac tcgcagtctg 
gctacggccg tggaggtgac cctaatggcg 
ttccactcct caccaacggg caaatggtgg 
tgccttcttt catgggtggt gggggaaaga 
gcttacctgt gcaacccagg tctatggacc 
gtagtgttgc ttggaaggaa cggatggaga 
agacggggaa tgatggtggt ggtgatgatg 
aagcaagaca acaactgtcc aggaaaattc 
ggatgattat cattattcgg cttgtggttt 
atccggtgaa tgatgcattt gctttgtggc 
ccatgtcttg gattcttgat caattcccaa 
tagaccggct gtcactgagg ttcgacaagg 
atttctttgt cag-cacggtc garcccrtaa 
ttctatctat cctttcggtg gattatcctg 
atggtgctgc aatgctaacg tttgaagcat 
gggttccttt ctgcaaacgg tacaatattg 
agaagataga ctacttgaaa gacaaggtgg 
tgaagagaga gtatgaggaa ttcaaggtga 
aagttcctga agaaggatgg acaatgcaag 
gtgatcatcc tggaatgatt caggtcttcc 
gaaatgaact gccacgattg gtttatgttt 
ataagaaagc tggtgctatg aatgcattgg 
catatttgtt aaacttggat tgtgarcact 
caatgtgttt tatgatggac ccxttactag 
aaagatttga tgggattgat cgccatgacc 
atatcaacat gaaaggtttg gatggtattc 
tatttaqaag gcaggcatta tatqgttatg 
ggacttgcaa ctgctggccc aagtggtgct 
aaaagaagac taccaaaccc aaaacagaga 
agaaccaatc ccctgcatat gctcttggtg 
atgaaaaggc cggtartgta aatcaacaaa 
tttttgttac atccacactt ctcgagaatg 
ctcttttgaa agaagctata catgtcatta 
gaaaagagat tggctggarc tatggatcag 



ctcggcgttc ggtgagaaac cactcggggg 60 
tacggtcagt atcctctgcc ttcgtcggcg 120 
cgagcgccgg gctggrggcc ggctcccaca 180 
acggcgatcc cgggccgaag ccgccgcggg 24 0 
gcgacgacgt cggccttgcc cccggcgggg 300 
tccccgtctg ccgggactgc tacgaatacg 360 
agtgcaagac tcgatacaag cgcctcaagg 420 
aggacggcgt cgatgaccrg gacaacgagt 480 
tggccgagtc catgctctac ggccacatga 540 
cgccacaagc tttccagctc aaccccaatg 600 
atgacatccc accggagcag cacgcgctgg 660 
ggatacatcc ccttccttat gcggatccca 720 
catccaagga tcttgctgca tatgggtatg 780 
attggaagca gagacaagag aggatgcacc 840 
gtgacgatgc tgatctacca ctaatggatg 900 
cacttccatc aagccagatt aatccatata 960 
tggggttctt cttccactac cgagtgatgc 1020 
tcatatctgt tatctgtgaa atctggtttg 1080 
agtggctccc tattgagaga gagactiacc 1140 
aaggccagcc atcrcaactt gctccaattg 1200 
aggaacctcc rttggrcaca acaaatactg 1260 
ttgataaggt ttcttgctat gtttctgatg 1320 
tatctgaaac atctgaattt gcaaagaaat 1380 
aacctcgcgc tccagagtgg tacttccaac 1440 
cagcaaactt tgttagggag aggagagcaa 1500 
gaatcaargc cttagttgcc aaagcccaga 1560 
atggaacccc ctggcctgga aacaatgttc 1620 
ttggccaaag cggaggccrt gactgtgagg 1680 
ctagagagaa acgaccaggc tataaccatc 1740 
tccgagtctc tgctgtacta acaaatgctc 1800 
acatcaacaa cagcaaggct ataaaggaag 18 60 
gaaagaaggt ttgctatgta cagttccctc 1920 
gatatgctaa ccggaatgtt gtcttttttg 1980 
agggtccaat ttatgttggt actggatgtg 2040 
atgcccccaa aacaaagaag ccaccatcaa 2100 
tttgctgttg ctgctttggc aataggaagc 2160 
agaaaaagtt a ctatttttc aagaaagaag 2220 
aaattgacga agctgctcca ggagctgaga 2280 
aattagaaaa gaaatttggc caatcttctg 2340 
gtggaacctt gaagagtgca agtcctgctt 2400 
gttgtggtta tgaagacaag acagactggg 24 60 
ttacagaaga tattctaact ggtttcaaga 2520 
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tgcattgtca 
ctgcacctct 
ttgagatctt 
ttttggaaag 
tggcttactg 
tgaataatgt 
tcctagaaat 
gggtcattgg 
tagctggtgt 
cagagctata 
tgaacttcat 
ggggccccct 
ttctcaaagg 
ttctgctggc 
atgatggtcc 
ctcccccaat 
ctgtagataa 
tatctgagag 
tggaatacag 
tagatgggct 
tggttctcaa 
atttatgtat 
aacagaatat 
tgtgtactgt 



tggttggcgg 
gaatctttca 
cttcagcaat 
attttcctac 
tacattgcct 
tgccagcctg 
gagatggagt 
aggtgtgtcc 
tgaracaagc 
tacattcaaa 
tggtgtggtc 
ctttgggaag 
tttggttgga 
ttcaatcttc 
gcttcttgag 
ctgcatatgc 
gagacatgaa 
ataractggg 
catatgcaag 
gagccgaaca 
taaggcaggc 
gcctgttcat 
tagcattaat 
aatcatctga 



tcaatttact 
gatcgtcttc 
cattgccctc 
atcaactcca 
gccatctgtt 
tggttcatgt 
ggtgttggaa 
tcacacctct 
ttcaccgtga 
tggactacct 
gctggcgttt 
ctattctttg 
aggcaaaaca 
tcgctccttt 
gagtgtggtt 
ttgaagtata 
atgtcccaag 
ggaaaatgga 
tgtttgattg 
gcaaggtatt 
aggaatgcat 
tggagggctt 
ctatagttaa 
gttggttttg 



gcatacctaa 
accaggtgct 
ttrggtatgg 
tcgtgtatcc 
tattgacagg 
cactttttat 
ttgatgactg 
ttgctgtgtt 
catcaaaggg 
tattgatacc 
caaatgcgat 
cattttgggt 
ggacaccaac 
gggttcggat 
tggattgcaa 
ttttctggtg 
tttctttrga 
ggctgcggca 
tgcagcattc 
ttgattctgc 
crgccagtgg 
gtrcattaca 
traaagtatg 
tgaaaa 



acgggttgca 
tcggtgggct 
gtatggtggc 
ttggacatct 
gaaatttatc 
ctgcattttt 
gtggaggaat 
ccagggactt 
tggagatgat 
tcctaccacc 
caataacgga 
gattgtccat 
gattgtcatc 
tgatcctttc 
ctaggatgtc 
tttgtcccca 
tccatggtga 
atccttgtgc 
tttattactt 
actgctcccg 
aacagagcaa 
tgttcgtcta 
caaatgcgcc 



ttcaaaggtt 
cttgggtcta 
ggtctgaaat 
attcccctct 
actccagagc 
gctacgagca 
gagcagttct 
ctcaaggtca 
gaggagttct 
ttgcttctat 
tatgagtcat 
ctttatccct 
gtctggtcca 
cttgcgaagg 
agtgcatcag 
tattcagtgt 
acctacttaa 
agttgggccg 
ggtcgcaata 
tgtacaaact 
cctgcacatt 
tactagaaaa 
rgttttttgt 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3936 



<210> 10 

<211> 1086 

<212> PRT 

<213> Zea mays 

<400> 10 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
15 10 15 

Leu Val Val lie Arg Arg Asp Gly Asp Pro Gly Pro Lys Pro Pro Arg 
20 25 30 

Glu Gin Asn Gly Gin Val Cys Gin lie Cys Gly Asp Asp Val Gly Leu 
35 4 0 4 5 

Ala Pro Gly Gly Asp Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Asp Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn 
65 70 75 80 

Cys Pro Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Cys Gin Arg 
85 90 95 

Val Thr Gly Asp Glu Glu Glu Asp Gly Val Asp Asp Leu Asp Asn Glu 
100 105 110 

Phe Asn Trp Asp Gly His Asp Ser Gin Ser Val Ala Glu Ser Met Leu 
115 120 125 

Tyr Gly His Met Ser Tyr Gly Arg Gly Glv Asp Pro Asn Gly Ala Pro 
130 135 140 



Gin Ala Phe Gin Leu Asn Pro Asn Val Pro Leu Leu Thr Asn Gly Gin 
145 150 155 160 
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Met Val Asp Asp lie Pro Pro Glu Gin His Ala Leu Val Pro Ser Phe 
165 170 175 

Met Gly Gly Gly Gly Lys Arg lie His Pro Leu Pro Tyr Ala Asp Pro 
180 185 190 

Ser Leu Pro Val Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala 
195 200 205 

Ala Tyr Gly Tyr Gly Ser Vai Ala Trp Lys Glu Arg Met Glu Asn Trp 
210 215 220 

Lys Gin Arg Gin Glu Arg Met His Gin Thr Gly Asn Asp Gly Gly Gly 
225 230 235 240 

Asp Asp Gly Asp Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin 
245 250 255 

Gin Leu Ser Arg Lys lie Pro Leu Pro Ser Ser Gin lie Asn Pro Tyr 
260 265 270 

Arg Met lie lie lie lie Arg Leu Val val Leu Gly Phe Phe Phe His 
275 280 285 

Tyr Arg Val Met His Pro Val Asn Asp Ala Phe Ala Leu Trp Leu lie 
290 295 300 

Ser Val lie Cys Glu lie Trp Phe Ala Met Ser Trp lie Leu Asp Gin 
305 310 315 320 

Phe Pro Lys Trp Phe Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu 
325 330 335 

Ser Leu Arg Phe Asp Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro lie 
340 345 350 

Asp Phe Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val 
355 360 365 

Thr Thr Asn Thr Val Leu Ser lie Leu Ser Val Asp Tyr Pro Val Asp 
370 375 380 

Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe 
385 390 395 400 

Glu Ala Leu Ser Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe 
405 410 415 

<3 Lys Arg Tyr Asn lie Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin 
420 425 430 

s lie Asp Tyr Leu Lys Asp Lys Val Ala Ala Asn Phe Val Arg 
435 440 445 

Glu A i Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val A.rg lie 

450 455 460 

Asn Ala Le I Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr 
465 470 475 480 
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Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro 
485 490 495 

Gly Met lie Gin Val Phe Leu Gly Gin Ser Gly Gly Leu Asp Cys Glu 
500 505 510 

Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro 
515 520 525 

Gly Tyr Asn His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg 
530 535 540 

Val Ser Ala Val Leu Thr Asn Ala Pro Tyr Leu Leu Asn Leu Asp Cys 
545 550 555 560 

Asp His Tyr He Asn Asn Ser Lys Ala He Lys Glu Ala Met Cys Phe 
565 570 575 

Met Met Asp Pro Leu Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro 
5S0 585 590 

Gin Arg Phe Asp Gly He Asp Arg His Asp Arg Tyr Ala Asn Arg Asn 
595 600 " " 605 

Val Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly 
610 615 620 

Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr 
625 630 635 640 

Gly Tyr Asp Ala Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn 
645 650 655 

Cys Trp Pro Lys Trp Cys Phe Cys Cys Cys Cys Phe Gly Asn Arg Lys 
660 665 670 

Gin Lys Lys Thr Thr Lys Pro Lys Thr Glu Lys Lys Lys Leu Leu Phe 
675 680 ~ 685 

Phe Lys Lys Glu Glu Asn Gin Ser Pro Ala Tyr Ala Leu Gly Glu He 
690 695 700 

Asp Glu Ala Ala Pro Gly Ala Glu Asn Glu Lys Ala Gly He Val Asn 
705 710 715 720 

Gin Gin Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Thr 
725 730 735 

Ser Thr Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala 
740 745 750 

Ser Leu Leu Lys Glu Ala He His Val lie Ser Cys Gly Tvr Glu Asp 
755 760 765 

Lys Thr Asp Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr 
770 775 780 

Glu Asp He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser 
785 790 795 800 



17 



WO 00/04166 



PCT/US99/15871 



lie Tyr Cys lie Pro Lys Arg Val Ala Phe Lys Gly Ser Ala Pro Leu 
805 810 815 

Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser 
820 825 830 

lie Glu lie Phe Phe Ser Asn His Cys Pro Leu Trp Tyr Gly Tyr Gly 
835 840 845 

Gly Gly Leu Lys Phe Leu Glu Arg Phe Ser Tyr lie Asn Ser lie Val 
850 855 860 

Tyr Pro Trp Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala 
865 870 875 880 

lie Cys Leu Leu Thr Gly Lys Phe lie Thr Pro Glu Leu Asn Asn Val 
885 890 895 

Ala Ser Leu Trp Phe Met Ser Leu Phe lie Cys lie Phe Ala Thr Ser 
900 905 910 

lie Leu Glu Met Arg Trp Ser Gly Val Gly lie Asp Asp Trp Tro Arg 
915 920 " 925 

Asn Glu Gin Phe Trp Val lie Gly Gly Val Ser Ser His Leu Phe Ala 
930 935 940 

Val Phe Gin Gly Leu Leu Lys Val lie Ala Gly Val Asp Thr Ser Phe 
945 950 955 960 

Thr Val Thr Ser Lys Gly Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr 
965 970 975 

Thr Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu 
980 985 990 

Leu Asn Phe He Gly Val Val Ala Gly Val Ser Asn Ala He Asn Asn 
995 1000 * 1005 

Gly Tyr Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 
1010 1015 " 1020 

Trp Val He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg 
1025 1030 1035 1040 

Gin Asn Arg Thr Pro Thr He Val He Val Trp Ser He Leu Leu Ala 
1045 1050 1055 

Ser He Phe Ser Leu Leu Trp Val Arg He Asp Pro Phe Leu Ala Lys 
1060 1065 1070 

Asp Asp Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1075 1080 1085 

<210> 11 

<211> 1138 

<212> DNA 

<213> Oryza sativa 
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<400> 11 

cgctgctccc 

tcgccacggc 

gtgacgtgct 

gcggcccggt 

aacctggagg 

agggggagct 

cgcgagtacg 

cggcgccgcg 

gggagcgtgc 

gacgtgtgcc 

caccggaagg 

gtcaacaact 

ggagccgccg 

gtcgatcccg 

ggcctcgacg 

gcgctctaca 

ctcggagaag 

aggtgaaaaa 

aaaatgaaat 



ggcgatggac 
gaacaccgtg 
atatttccga 
tcgcggcgct 
cgtacttcaa 
acagggggat 
aggagar gcg 
gcgcggccga 
cacagctcgg 
tcccggcgct 
cgggcgccat 
cgcaggccct 
aagacgccgg 
gcgaccgcta 
gcctccaggg 
gcgtcgacct 
acgagcggct 
ctcagctaaa 
ttacrttrtr 



gtgttcgrca 
ctgtccatat 
cgacgcaggc 
ttgggtgtcg 
cgccggcgag 
ggcgtggccg 
gctgcggatc 
tgaccacgcc 
cgttgcgaac 
tgtgtacgtg 
gaacgcgccc 
ccgcgccggc 
cgcggtcgcg 
cgccaaccac 
ccccatctac 
gccgcgctgg 
atggtccagg 
acctgaccca 
ttttttacca 



ccaccgccga 
atcctcgccg 
gcggaggtga 
ttctgccgga 
ggtggtggtg 
gagctggrgc 
gacgcgctgc 
ggagttgtgc 
gggagcaagc 
tgccgcgaga 
ttcatcctcg 
atctgcttca 
ttcgtccagt 
aaccgcgtcc 
gtcggcaccg 
agaccgcggc 
atgaaacaaa 
agctgtaaca 
aaaaaaaaaa 



ccccgacaag 
cgggctaccc 
cacgtaacgc 
agcacggcgt 
gcaaggcgaa 
gcgacaggag 
aggccgccga 
aggtactgat 
tcatcgacgt 
agcgccgcgg 
acctcgactg 
tgatcgaacg 
tcccgcagcg 
tcttcgactg 
gctgcttgtt 
gttcattggg 
tggtaatatt 
tgggtaaaaa 
aaaaaaaaaa 



gagccgccgc 
cgccggcaag 
ggtcgtggag 
cgagccgagg 
ggtggtggcg 
acgggtgcgc 
tgcgcgccgc 
cgattttgct 
cgcctctgtc 
ccacgcgcac 
cgactactac 
cggcggcggc 
ggtcgacggc 
caccgagctc 
ccgccgtgtc 
ctgtcgccta 
aagtggtcca 
tatatggccc 
aaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1138 



<210> 12 

<211> 341 

<212> PRT 

<213> Oryza sativa 

<400> 12 

Arg Cys Ser Arg Arg Trp Thr Cys Ser Ser Pro Pro Pro Thr Pro Thr 
15 10 15 



Arg Ser Arg Arg Ser Pro Arg Arg Thr 
20 25 



Pro Cys Cys Pro Tyr He Leu 
30 



Ala Ala Gly Tyr Pro Ala Gly Lys Val 
35 40 



Thr Cys Tyr He Ser Asp Asp 
45 



Ala Gly Ala Glu Val Thr Arg Asn Ala 
50 55 



Val Val Glu Ala Ala Arg Phe 
60 



Ala Ala Leu Trp Val Ser Phe Cys Arg 
65 70 



Lys His Gly Val Glu Pro Arg 
75 80 



Asn Leu Glu Ala Tyr Phe Asn Ala Gly 
85 



Glu Gly Gly Gly Gly Lys Ala 
90 95 



Lys Val Val Ala Arg Gly Ser Tyr Arg 
100 105 



Gly Met Ala Trp Pro Glu Leu 
110 



Val Arg Asp Arg Arg Arg Val Arg Arg 
115 120 



Glu Tyr Glu Glu Met Arg Leu 
125 



Arg He Asp Ala Leu Gin Ala Ala Asp 
130 135 



Ala Arg Arg Arg Arg Arg Gly 
140 



Ala Ala Asp Asp His Ala Gly Val Val 
145 150 

Gly Ser Val Pro Gin Leu Gly Val Ala 
165 



Gin Val Leu He Asp Phe Ala 
155 160 

Asn Gly Ser Lys Leu He Asp 

170 175 
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Val Ala Ser Val 
180 



Asp Val Cys Leu Pro Ala Leu Val Tyr Val Cys Arg 
185 190 



Glu Lys Arg Arg 
195 



Gly His Ala His His Arg Lys Ala Gly Ala Met Asn 
200 205 



Ala Pro Phe lie 
210 



Leu Asp Leu Asp Cys Asp Tyr Tyr Val Asn Asn Ser 
215 220 



Gin Ala Leu Arg 
225 



Ala Gly lie Cys Phe Met lie Glu Arg Gly Gly Gly 
230 235 240 



Gly Ala Ala Giu Asp Ala Gly Ala Val Ala Phe Val Gin Phe Pro Gin 
245 250 255 



Arg Val Asp Gly 
260 



Val Asp Pro Gly Asp Arg Tyr Ala Asn His Asn Arg 
265 270 



Val Leu Phe Asp 
275 



Cys Thr Glu Leu Gly Leu Asp Gly Leu Gin Gly Pro 
280 285 



lie Tyr Val Gly 
290 



Thr Gly Cys Leu Phe Arg Arg Val Ala Leu Tyr Ser 
295 300 



Val Asp Leu Pro 
305 



Arg Trp Arg Pro Arg Arg Ser Leu Gly Cys Arg Leu 
310 315 320 



Leu Gly Glu Asp Glu Arg Leu Trp Ser Arg Met Lys Gin Met Val lie 

330 335 



Leu Ser Gly Pro 
340 



325 
Arg 



<210> 13 

<211> 3517 

<212> DNA 

<213> Glycine max 



<400> 13 

gcacgagcca 

caatcttcct 

tgttgtgcca 

cgctgggtca 

tttgaagaac 

ggatggagac 

tgagtatgaa 

tctcaaaggg 

gcatgaattc 

tcatgggagg 

acctgtcatt 

tgcttatggg 

gtctgaacct 

tggatgactg 

cagccatgtt 

aaatcaatcc 

gatacagact 

gtgaaatctg 

atagagagac 

tgcttgctcc 

ttacagcaaa 



acaacaacac 
tcataaaaca 
ttcccatatr 
cataaccgca 
ttggatgggc 
ttgtttgtgg 
aggagagaag 
agcccccgag 
aatattgatg 
atgagctatg 
gctggtggtc 
gar cagatgt 
ggaagtgcaa 
gaaattgcag 
agatgaagca 
atatagaatg 
catgaaccca 
gtttgctttt 
ctaccttgac 
tgtagatgtt 
cactgttctt 



ccttatgtgg 
caaacctttg 
gtcccattca 
atgagctagt 
aagtgtgtga 
cttgcaatga 
gaagccacct 
tggagggaga 
agcaaaagaa 
gaagaggtcc 
gttctaggcc 
tatcctcttc 
gatgggacga 
caaggcaatt 
aggcaaccac 
gtgattgtgg 
gtacatgatg 
tcatggattc 
cgtctttcca 
tttgttagta 
tcaatcttgg 



acacatiagg 
atcacacaat 
ctaagacatg 
tgtcattcat 
gatttgtggt 
gtgtggtttt 
ttgcccacag 
tgatgatgaa 
caagcatggc 
tgaagatgat 
tgtaagtggg 
actgcataaa 
aaaaaaaaga 
tggggcctga 
tgtcaaggaa 
cacgtctggt 
ccctggggct 
tggatcagtt 
tcaggtatga 
ccgtggatcc 
ccatggatta 



tgaggttcaa 
ctcaccttaa 
gaagccagcg 
ggccatgaag 
gatggcgtgg 
ccagtgtgca 
tgcaaaacca 
gaggatgtgg 
caggttgcag 
gacaattccc 
gagttcccaa 
agagtgcatc 
agatggatgg 
accggatgaa 
agtgccaata 
tattcttgct 
atggctaacc 
tcccaaatgg 
gcgtgaaggt 
catgaaggaa 
cccggttgat 



cagctagcac 
tttgtgttgt 
ctggactggt 
agccgaaggc 
gactcacggt 
ggccttgcta 
gatacaagcg 
atgatattga 
aagccatgct 
agrtcccaac 
tatcatctaa 
catatccagt 
aaagatagaa 
gatccagatg 
gcatccagca 
ttcttcctca 
tctatcatat 
tttcccattg 
gaaccaaaca 
cctcctctgg 
aaaatatcat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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gctacatttc 
agtttgctag 
agatgtactt 
aggagcgtcg 
ttgctaaggc 
cagggaataa 
gtcttgatac 
ctggttttca 
ttctcacaaa 
aggctgcccg 
atgtccagtt 
acacagtttt 
tggggactgg 
caaagcgtcc 
ataaggagaa 
aagaggtgtt 
tgacttctac 
ttaaagaagc 
agcttggttg 
gccgtgggtg 
ctatcaactt 
ttttctrtag 
ttgagagatt 
cctactgtat 
gcacctttgc 
ttgagr tgaa 
tcattggtgg 
ctggaattga 
aattgtacac 
acattgttgg 
gaccactctt 
ttaaaggttt 
tattggcctc 
agggacctga 
attattcttt 
tggcaatgca 
tatcattttt 
agttttatct 



tgatgatgga 
aaagtgggta 
cagcgagaag 
agctatgaag 
ccagaaagtt 
cactaaggat 
tgaaggaaac 
acaccacaag 
tgctcctttc 
agaggccatg 
tcctcaaaga 
ctttgatatt 
atgtgttttc 
aaaaatggta 
gaatgatgca 
gatgtcccaa 
cttgatggaa 
cattcargtg 
gatctatgga 
gaggtccatt 
gtcagatcgt 
tcaccattgc 
tgcctatgca 
tcttccagca 
tggtttgtac 
atggagtgga 
tgtatcagct 
caccaatttc 
ctttaagtgg 
tgttgttgct 
tggaaagctc 
gatgggtcgc 
tattttctcc 
taccaagcta 
aaaagatgta 
caaggatcaa 
gtaaatgttc 
attcactgga 



gcctcaatgt 
ccgttttgta 
attgactacc 
agggaaracg 
cctcagggag 
catcctggta 
caacttccrc 
aaagctggtg 
atgttgaact 
cgcttcttga 
tttgatggta 
aacatgaagg 
aggaggcaag 
agctgtgatt 
aatggagagg 
atgaattttg 
gagggtggtg 
attagctgtg 
tctatcacag 
tattgtatgc 
ctcaaccagg 
cctctatggt 
aacacaacrg 
gtttgtttac 
tttgttgctc 
gtgagcattg 
cacctctttg 
actgttacat 
actacactct 
ggaatctcag 
ttcttttcct 
caaaatcgca 
ttactttggg 
tgtggaatca 
tgtagggtac 
taaggaaaga 
tcaaggacat 
aaaaaaaaaa 



gtacatttga 
agaaattttc 
taaaggacaa 
aagagrttaa 
gatggatcat 
tgattcaagt 
gccttgttta 
ccatgaatgc 
tggattgtga 
tggacccaca 
ttgatacaca 
gtctagatgg 
ctttgtatgg 
gttgcccgtg 
ctgcaagcct 
agaagaaatt 
tgcctccttc 
garatgaaga 
aagatattct 
caaagagagc 
ttcttcgttg 
atggcttcaa 
tctatccait 
tcactgacaa 
tcttctcctc 
aggaatggtg 
ctgttataca 
caaaggcaac 
tgattcctcc 
atgccataaa 
tcrgggtgat 
cacccaccat 
taagaattga 
actgctaaaa 
atacattctt 
gtgaaaattt 
ctgttttggt 
aaaaaaa 



gtccttatca 
catagaacct 
agtgcaaccc 
ggttaggatc 
gcaggatggg 
gtttcttggt 
rgtttccaga 
rctggttcgg 
rcactatgtc 
aactgggaag 
tgatcgttat 
nartcaaggt 
ctataatcct 
ttttggaagc 
aaaagggatg 
tggacaatcc 
ttcaagtcca 
taaaactgaa 
aacaggtttt 
rgcattcaag 
ggcacttggt 
ggaaaagaag 
cacctccatt 
artcatcatg 
aatcattgca 
gagaaatgag 
aggtctgcta 
agatgatgaa 
aaccactatt 
caatgggtac 
tgtccatctc 
tgttgtgatt 
tccatttgrc 
aagactgctt 
ggtttcacaa 
-gtgtatcat 
r ggaactgcc 



gaaactgcag 
cgggcacctg 
acctttgtta 
aatgcacttg 
acaccatggc 
agcagtggag 
gagaaaaggc 
gtatctgctg 
aataacagca 
aaggtctgct 
gccaacagga 
cctgtatatg 
cccaagggtc 
cgcaagaagt 
gatgatgaca 
tctatttttg 
gctgccctgc 
tggggacttg 
aagatgcatt 
ggractgcr c 
tccattgaga 
ctaaagtggc 
cctctagttg 
ccaccgatta 
ac-cggtattc 
cagttttggg 
aaggttctgg 
gagtttggag 
ttgatcatta 
caatcctggg 
tatccattcc 
tggtcagtgc 
ctcaagacta 
tccctatagt 
accaacaaag 
aaatgagtgt 
caaaaattgc 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3517 



<210> 
<211> 
<212> 
<213> 



14 

1039 
PRT 

Glycine max 



<220> 

<221> UNSURE 
<222> (201) 

<400> 14 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
15 10 15 

Leu Vai Val lie His Gly His Glu Glu Pro Lys Ala Leu Lys Asn Leu 
20 25 30 

Asp Gly Gin Val Cys Glu He Cys Gly Asp Gly Val Gly Leu Thr Val 
35 40 45 

Asp Gly Asp Leu Phe Val Ala Cys Asn Glu Cys Gly Phe P^o Val Cys 
50 55 60 
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Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Ser His Leu Cys Pro 
65 70 75 80 

Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Arg Val Glu 
85 90 95 

Gly Asp Asp Asp Glu Glu Asp Val Asp Asp lie Glu His Glu Phe Asn 
100 105 110 

lie Asp Glu Gin Lys Asn Lys His Gly Gin Val Ala Glu Ala Met Leu 
115 120 125 

His Gly Arg Met Ser Tyr Gly Arg Gly Pro Glu Asp Asp Asp Asn Ser 
130 135 140 

Gin Phe Pro Thr Pro Val lie Ala Gly Gly Arg Ser Arg Pro Val Ser 
145 150 155 160 

Gly Glu Phe Pro lie Ser Ser Asn Ala Tyr Gly Asp Gin Met Leu Ser 
165 170 175 

Ser Ser Leu His Lys Arg Val His Pro Tyr Pro Val Ser Glu Pro Gly 
180 185 190 

Ser Ala Arg Trp Asp Glu Lys Lys Xaa Asp Gly Trp Lys Asp Arg Met 
195 200 205 

Asp Asp Trp Lys Leu Gin Gin Gly Asn Leu Gly Pro Glu Pro Asp Glu 
210 215 220 

Asp Pro Asp Ala Ala Met Leu Asp Glu Ala Arg Gin Pro Leu Ser Arg 
225 230 235 240 

Lys Val Pro lie Ala Ser Ser Lys lie Asn Pro Tyr Arg Met Val lie 
245 250 255 

Val Ala Arg Leu Val lie Leu Ala Phe Phe Leu Arq Tyr Arg Leu Met 
260 265 " 270 

Asn Pro Val His Asp Ala Leu Gly Leu Trp Leu Thr Ser lie lie Cys 
275 280 285 

Glu He Trp Phe Ala Phe Ser Trp He Leu Asp Gin Phe Pro Lys Trp 
290 295 300 

Phe Pro He Asp Arg Glu Thr Tyr Leu Asp Arg Leu Ser He Arg Tyr 
305 310 315 320 

Glu Arg Glu Gly Glu Pro Asn Met Leu Ala Pro Val Asp Val Phe Val 
325 330 335 

Ser Thr Val Asp Pro Met Lys Glu Pro Pro Leu Val Thr Ala Asn Thr 
340 345 350 

Val Leu Ser He Leu Ala Met Asp Tyr Pro Val Asp Lys He Ser Cys 
355 360 365 

Tyr lie Ser Asp Asp Gly Ala Ser Met Cys Thr Phe Glu Ser Leu Ser 
370 375 380 
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Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lys Phe 
385 390 395 400 

Ser lie Glu Pro Arg Ala Pro Glu Met Tyr Phe Ser Glu Lys He Asp 
405 410 415 

Tyr Leu Lys Asp Lys Val Gin Pro Thr Phe Val Lys Glu Arg Arg Ala 
420 425 430 

Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val 
435 440 445 

Ala Lys Ala Gin Lys Val Pro Gin Gly Gly Trp lie Met Gin Asp Gly 
450 455 460 

Thr Pro Trp Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He Gin 
465 470 475 480 

Val Phe Leu Gly Ser Ser Gly Gly Leu Asp Thr Glu Gly Asn Gin Leu 
485 490 495 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin His 
500 505 510 

His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Ala Val 
515 520 525 

Leu Thr Asn Ala Pro Phe Met Leu Asn Leu Asp Cys Asp His Tyr Val 
530 535 540 

Asn Asn Ser Lys Ala Ala Arg Glu Ala Met Cys Phe Leu Met Asp Pro 
545 550 555 " 560 

Gin Thr Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
565 570 575 

Gly He Asp Thr His Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe Phe 
580 585 590 

Asp He Asn Met Lys Gly Leu Asp Gly lie Gin Gly Pro Val Tyr Val 
595 600 605 

Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Asn Pro 
610 615 620 

Pro Lys Gly Pro Lys Arg Pro Lys Met Val Ser Cys Asp Cys Cys Pro 
625 630 635 640 

Cys Phe Gly Ser Arg Lys Lys Tyr Lys Glu Lys Asn Asp Ala Asn Gly 
645 650 655 

Glu Ala Ala Ser Leu Lys Gly Met Asp Asp Asp Lys Glu Val Leu Met 
660 665 670 

Ser Gin Met Asn Phe Glu Lys Lys Phe Gly Gin Ser Ser He Phe Val 
675 680 685 

Thr Ser Thr Leu Met Glu Glu Gly Gly Val Pro Pro Ser Ser Ser Pro 
690 695 700 
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Ala Ala Leu Leu Lys Glu Ala lie His Val lie Ser Cvs Gly Tyr Glu 
705 710 715 " 720 

Asp Lys Thr Glu Trp Gly Leu Glu Leu Gly Trp lie Tyr Gly Ser lie 
725 730 735 

Thr Glu Asp lie Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Arg 
740 745 750 

Ser He Tyr Cys Met Pro Lys Arg Ala Ala Phe Lys Gly Thr Ala Pro 
755 760 765 

He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arc Trp Ala Leu Gly 
770 775 780 

Ser He Glu He Phe Phe Ser His His Cys Pro Leu Trp Tyr Gly Phe 
785 790 795 800 

Lys Glu Lys Lys Leu Lys Trp Leu Glu Arg Phe Ala Tyr Ala Asn Thr 
805 810 815 

Thr Val Tyr Pro Phe Thr Ser He Pro Leu Val Ala Tyr Cvs He Leu 
820 825 830 

Pro Ala Val Cys Leu Leu Thr Asp Lys Phe He Met Pro Pro He Ser 
835 840 845 

Thr Phe Ala Gly Leu Tyr Phe Val Ala Leu Phe Ser Ser He He Ala 
850 855 860 

Thr Gly lie Leu Glu Leu Lys Trp Ser Gly Val Ser He Glu Glu Trp 
865 870 875 880 



Trp Arg Asn Glu Gin Phe Trp Val 
885 

Phe Ala Val 



Asn Phe Thr Val Thr Ser Lys Ala 

915 920 

Leu Tyr Thr Phe Lys Trp Thr Thr 

930 935 



He Gly Gly Val Ser Ala His Leu 
890 895 

Leu Ala Gly He Asp Thr 
910 

Thr Asp Asp Glu Glu Phe Gly Glu 
925 

Leu Leu He Pro Pro Thr Thr He 
940 



He Gin Gly Leu Leu Lys Val 
900 905 



Leu He He Asn He Val Gly Val Val Ala Gly He Ser Asp Ala He 

945 950 955 960 

Asn Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe 
965 970 " 975 

Ser Phe Trp Val He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met 
980 985 990 

Gly Arg Gin Asn Arg Thr Pro Thr He Val Val He Tro Ser Val Leu 
995 1000 1005 

Leu Ala Ser He Phe Ser Leu Leu Trp Val Arcr He Asp Pro Phe Val 
1010 1015 " 1020 
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Leu Lys Thr Lys Gly Pro Asp Thr Lys Leu Cys Gly lie Asn Cys 
1025 1030 1035 

<210> 15 

<211> 2125 

<212> DNA 

<213> Glycine max 



<400> 15 

gccaaagctc 

ggaaataatc 

ctggatacag 

ggcttccaac 

ttgaccaatg 

gccctcaaag 

gttcaatttc 

attgtgttct 

ggaactggtt 

gaagatttgg 

ggtggcaata 

cccatattta 

ctacttatgt 

gccactttca 

gaagcaatcc 

ggatggatct 

ggttggattt 

aatctttct g 

ctaagcaggc 

cttgcttata 

acgcttcctg 

gccagtatgt 

aggtggagtg 

gggacatctg 

gatacaaatt 

tatgtgttta 

gttgggattg 

ctatttggca 

ggtctcttgg 

gcttcaatat 

ttaaccaatg 

ttattccctt 

gcctatcaaa 

tttctattat 

gttatatttt 

aaggtcctgt 



agaagat gcc 
ctagggatca 
atggaaatga 
atcacaagaa 
gtgcatatct 
aagccatgtg 
ctcagagatt 
ttgatatcaa 
gttgtttcaa 
aacctaacat 
agaagtacag 
atatggaaga 
ctcaaaagag 
tggagcaggg 
atgttatcag 
atggctctgt 
ccatctattg 
atcgtctcaa 
attgtcccrt 
ttaacaccat 
cattttgtct 
ggttcattct 
gggtcagtat 
cgcatctctt 
tract gttac 
aatggacatc 
tggctggtgt 
agctgttctt 
gcaggcaaaa 
tctccttgct 
gtcaatgtgg 
trgcttggag 
gttgtctgga 
ttatctacat 
ttccccacag 
gttgttttgt 



agaggaaggt 
tccgggaatg 
gctgcctaga 
ggctggagct 
tctgaatgtg 
tttcatgatg 
tgacggcart 
catgaaaggt 
taggcaagct 
tattgtaaag 
tgacaagaag 
catagaggag 
cttggagaag 
tggcattcca 
ctgtggttac 
gacagaagat 
catgccacct 
tcaggtgctt 
gtggtatggc 
tgtctacccg 
tctcacaaat 
tctctttgtc 
agaagactgg 
tgctgtgttc 
atcgaaggca 
acttctcatc 
atcctatgcc 
tgctatctgg 
icgiacccca 
gtgggtgagg 
catcaactgt 
atacacaagg 
ggattgaacc 
gttccttaca 
aatataaatt 
tcttt 



tggacaatgc 
attcaggtgt 
cttgtttatg 
atgaatgctt 
gattgtgatc 
gatcctgttc 
gacttgcacg 
caggatggtg 
ttgtatggtt 
agttgttgcg 
aaggcgatgg 
ggtgttgaag 
cgttttggtc 
ccttcaacga 
gaagacaaga 
atcttgactg 
cgcccagcat 
cggtgggcct 
tacaatggga 
tttacctcaa 
aaatttatta 
tccattttta 
tggagaaatg 
caggggcttc 
tcggacgagg 
cctcctacaa 
ataaacagtg 
gtcattgccc 
accattgtta 
attgatccct 
tagttctctt 
ttgctgtcgt 
cctgaaatag 
agaatagtca 
tgttcatgcg 



aggatggaac 
ttttaggtca 
tttctcgtga 
tgattcgagt 
actatttcaa 
ttggaaagaa 
atcgatatgc 
ttcagggccc 
atgatcctgt 
gttctagaaa 
gaagaactga 
gttatgatga 
agtctccagt 
accctgcaac 
cagaatgggg 
ggttcaagat 
ttaagggttc 
tgggttcaat 
agttgaagcc 
tcccattgat 
ttcctgagat 
ccacttcaat 
aacagttctg 
taaaagtgct 
atggggactt 
cagtgcttat 
gttaccagtc 
atctataccc 
ttgtttggtc 
tcacctctga 
gtatgattca 
gtatatagca 
atgggaatgt 
gtagtaatgt 
aatatttaat 



tccttggcct 
tagtgggggg 
gaagcgacca 
ttctgctgtc 
taatagcaaa 
gacatgctat 
caatcgcaat 
agtctatgtg 
tttgactgag 
gaagggaaag 
atccactgta 
tgaaaggaca 
ttttattgct 
tcttcttaag 
caaagagatt 
gcatgctcgt 
tgctcctatc 
tgagatcttt 
tctgatgagg 
tgcttactgt 
aagcaacttt 
tcttgagctt 
ggttatcggt 
tgctgggatc 
tgccgagctt 
tgtgaatttg 
ttggggtcca 
attcttgaag 
cgttcttctt 
ctccaacaaa 
ttttgtgttg 
agaatttrca 
accctctctg 
tgaggtgtat 
gaaagccaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2125 



<210> 16 

<211> 610 

<212> PRT 

<213> Glycine max 

<400> 16 

Ala Lys Ala Gin Lys Met Pro Glu Glu Gly Trp Thr Met Gin Asp Gly 
15 10 15 



Thr Pro Trp Pro Gly Asn Asn Pro Arg Asp His Pro Gly Met He Gin 
20 25 30 
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Val Phe Leu Gly 
35 

Pro Arg Leu Val 

50 

His Lys Lys Ala 

65 

Leu Thr Asn Gly 



Asn Asn Ser Lys 
100 

Val Leu Gly Lys 
115 

Gly lie Asp Leu 
130 

Asp lie Asn Mer 
145 

Gly Thr Gly Cys 



Val Leu Thr Glu 
180 

Cys Gly Ser Arg 
195 

Lys Lys Lys Ala 
210 

Met Glu Asp lie 
225 

Leu Leu Met Ser 



Val Phe lie Ala 
260 

Thr Asn Pro Ala 
275 

Gly Tyr Glu Asp 
290 

Gly Ser Val Thr 
305 

Gly Trp lie Ser 



Ser Ala Pro lie 
340 



His Ser Gly Gly 
40 

Tyr Val Ser Arg 
55 

Gly Ala Met Asn 
70 

Ala Tyr Leu Leu 
85 

Ala Leu Lys Glu 



Lys Thr Cys Tyr 
120 

His Asp Arg Tyr 
135 

Lys Gly Gin A.sp 
150 

Cys Phe Asn Arg 
165 

Glu Asp Leu Glu 



Lys Lys Gly Lys 
200 

Met Gly Arg Thr 
215 

Glu Glu Gly Val 
230 

Gin Lys Ser Leu 
245 

Ala Thr Phe Met 



Thr Leu Leu Lys 
280 

Lys Thr Glu Trp 
295 

Glu Asp lie Leu 
310 

lie Tyr Cys Met 
325 

Asn Leu Ser Asp 



Leu Asp Thr Asp 



Glu Lys Arg Pro 
60 

Ala Leu lie Arg 
75 

Asn Val Asp Cys 
90 

Ala Met Cys Phe 
105 

Val Gin Phe Pro 



Ala Asn Arg Asn 
140 

Gly Val Gin Gly 
155 

Gin Ala Leu Tyr 
170 

Pro Asn lie lie 
185 

Gly Gly Asn Lys 



Glu Ser Thr Val 
220 

Glu Gly Tyr Asp 
235 

Glu Lys Arg Phe 
250 

Glu Gin Gly Gly 
265 

Glu Ala He His 



Gly Lys Glu He 
300 

Thr Gly Phe Lys 
315 

Pro Pro Arg Pro 
330 

Arg Leu Asn Gin 
345 



Gly Asn Glu Leu 
45 

Gly Phe Gin His 



Val Ser Ala Val 

80 

Asp His Tyr Phe 
95 

Met Met Asp Pro 
110 

Gin Arg Phe Asp 
125 

He Val Phe Phe 



Pro Val Tyr Val 
160 

Gly Tyr Asp Pro 
175 

Val Lys Ser Cys 
190 

Lys Tyr Ser Asp 
205 

Pro He Phe Asn 



Asp Glu Arg Thr 
240 

Gly Gin Ser Pro 
255 

lie Pro Pro Ser 
270 

Val He Ser Cys 
285 

Gly Trp lie Tyr 



Met His Ala Arg 
320 

Ala Phe Lys Gly 
335 

Val Leu Arg Trp 
350 
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Ala Leu Gly Ser He Glu He Phe 
355 360 

Tyr Gly Tyr Asn Gly Lys Leu Lys 
370 375 

Asn Thr He Val Tyr Pro Phe Thr 
385 390 

Thr Leu Pro Ala Phe Cys Leu Leu 
405 



Leu Ser Arg His Cys Pro Leu Trp 
365 

Pro Leu Met Arg Leu Ala Tyr He 
380 

Ser He Pro Leu lie Ala Tyr Cys 

395 400 

Thr Asn Lys Phe He He Pro Glu 
410 415 



He Ser Asn Phe Ala Ser Met Trp Phe He Leu Leu Phe Val Ser lie 
420 425 430 

Phe Thr Thr Ser lie Leu Glu Leu Arg Trp Ser Gly Val Ser lie Glu 
435 440 " 445 

Asp Trp Trp Arg Asn Glu Gin Phe Trp Val He Gly Gly Thr Ser Ala 
450 455 460 

His Leu P.ne Ala Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly He 
465 470 475 480 

Asp Thr Asn Phe Thr Val Thr Ser Lys Ala Ser Asp Glu Asp Gly Asp 
485 490 ~ 495 

Phe Ala Glu Leu Tyr Val Phe Lys Trp Thr Ser Leu Leu He Pro Pro 
500 505 510 

Thr Thr Val Leu He Val Asn Leu Val Gly lie Val Ala Gly Val Ser 
515 520 525 

Tyr Ala He Asn Ser Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys 
530 535 " ' 540 



Leu Phe Phe Ala He Trp Val He 
545 550 

Gly Leu Leu Gly Arg Gin Asn Arg 
565 

Ser Val Leu Leu Ala Ser He Phe 
580, 

Pro Phe Thr Ser Asp Ser Asn Lys 

595 600 



Ala His Leu Tyr Pro Phe Leu Lvs 
555 560 

Thr Pro Thr He Val He Val Tro 
570 575 

Ser Leu Leu Trp Val Arg lie Asp 
585 590 

Leu Thr Asn Gly Gin Cys Gly He 
605 



Asn Cys 
610 



<210> 17 

<211> 2890 

<212> DNA 

<213> Glycine max 



<400> 17 

gcacgagctc cacccagtta atgatgcata tggcttgtgg ttgacatcag tcatctgtga 60 

aatatggttt gctgtatcat ggataatgga tcagtttcca aaatggtacc caatacagcg 120 

agaaacatac cttgatcgtc tgtcactcag gtatgaaaaa gaagggaagc catcxgagtt 180 
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gtccagtgta 
agcaaacact 
tgtctcagat 
tgctaggaga 
gtactttggt 
aaggagagca 
aacagcacaa 
aaataatgtg 
tgatgttgaa 
gtttgatcac 
cactaatgca 
acttagagaa 
gcaatttcct 
tgtatttttc 
aactggargt 
accaccgagc 
aaagaaaaag 
atcaaagcag 
gaagacatcc 
tgtagcctcc 
tttaaaagaa 
agaagttggg 
ttgccatggt 
gcctatcaac 
gatttttttc 
ggaacgattt 
ctactgtact 
caactatgcc 
tgagatgcaa 
gatcggaggt 
tggtgtgaac 
actctacata 
tattgtcggg 
acctctctrt 
gaaggggttg 
tctggcctcc 
ccccgtgtr a 
aagaagggrt 
tgcttcacaa 
taaagagagg 
atrctttgat 
taagaaagag 
aagaaattac 
taccgtcaaa 
cactcaggaa 
aaaaaaaaaa 



gacgtctttg 
gttctatcta 
gatggtgctg 
tgggttccat 
cagaagatgg 
atgaagaggg 
aaggttcctg 
agggatcatc 
ggaaatgagc 
cacaaaaagg 
ccctatcttc 
gctatgtgtt 
cagcgatttg 
gatattaaca 
gttttcagaa 
aaaacttgta 
aatgccaata 
atacatgcac 
aatctgactc 
acacttttgg 
gccatccagg 
tggatatatg 
tggcggtcrg 
cittcagatc 
agcagacatt 
tcctacatta 
ctaccagcca 
agtcttgtgt 
tggggcggtg 
gtttcttccc 
acaaacttca 
ttcaagtgga 
gtggttgrcg 
ggtagattgt 
cttggaaaac 
atcttgactc 
gaaatttgtg 
agttartttt 
agttgcacga 
aaggggaggg 
tatattttct 
gcattgaatg 
ttggtcatat 
actatggata 
cctttrggtc 



t cagtactgt 
tccttgctgt 
ctatgcttac 
tttgtaagaa 
actatctgaa 
atratgaaga 
aggatggatg 
ctggcatgat 
taccccgctt 
ctggtgcaat 
tgaatgttga 
ttatgatgga 
atggaattga 
tgaaaggatt 
ggtacgcact 
actgttggcc 
gtaagaagga 
ttgaaaatat 
aaacaaagtt 
atgatggtgg 
tcarcagttg 
gttctgtgac 
rgcattgcat 
gtcrgcacca 
gtccaarctg 
actcggtcgt 
tatgccttct 
tcatggccct 
ttagcataga 
atctatttgc 
ctgtgacctc 
catcactctt 
ggatctcaga 
tctttgcatt 
aagatagaat 
tcatgtgggt 
gattgaattg 
cagctacact 
attttcggat 
ggctgacaca 
gugggtttta 
ataaattatt 
ttttaggtat 
ttcttgrtcc 
tgattcaatt 



"cgatcccatg 
tgattatcca 
ttttgaagca 
atacaatatt 
aaataaagta 
atttaaggtg 
gaccargcaa 
tcaggtcttc 
ggtctacgtt 
gaatgctctg 
ttgtgatcac 
tcctcaacta 
tagacatgat 
ggatgggata 
ttatggatat 
aaagtggtgc 
gaaaaagagg 
tgaggcgggg 
ggagaagagg 
agttccacat 
tggttatgaa 
agaggar ate 
tectaagegg 
agttcttcgg 
gtatggctat 
atatccctgg 
gacrggaaaa 
cttcatar cc 
cgactggtgg 
cctatttcag 
aaaagcagca 
gatccctcca 
tgccatcaac 
gtgggtgatc 
gccaaccatt 
cagaattaac 
cgacgagtcg 
gcagtcatgt 
ggaagtttta 
ttgttacctt 
gugttattct 
ccctcttcaa 
gggtcttgtt 
tcagatgtgt 
attcaatgtt 



aaggaacctc 
gttgataaag 
ctgtctgaga 
gagccccggg 
cacccagcat 
aggattaaca 
gatgggacrc 
cttgggcagg 
tctagagaaa 
gtaegggett 
tacattaaca 
gggaaaaagg 
agatattcaa 
caaggtccaa 
gatgeacctg 
tgcctatgtt 
aaggtgaagc 
aatgaaggaa 
ttrggacagt 
ggcgtgagtc 
gacaaaacag 
trgacrggat 
ectgeattta 
tgggctcttg 
ggtggrggat 
acttccctcc 
tttategtae 
attgeagcaa 
aggaacgaac 
ggtttactga 
gatgatggag 
atgactttac 
aatggttatg 
ctccatctct 
atattggttt 
ccgtttgtgt 
r gaataaaga 
tgaagaatgc 
tttttcgggr 
gtaatagggr 
ctttccagtt 
aatgggggat 
ccgtttaaaa 
ttttgtgttt 
tggatggcac 



cactgactac 
ttgeatgeta 
catctgaatt 
caccagaatg 
ttgtcaggga 
gtttggtggc 
cttggcctgg 
atggtgttcg 
agaggecagg 
cagcaattar 
atagcaaggc 
tttgctatgt 
acagaaatgt 
tatatgtegg 
ccaagaagaa 
gtggctctag 
acagtgaagc 
ccaacaatga 
ctccagtatt 
ctgcatcact 
aatggggaaa 
ttaaaatgea 
aggggtctgc 
gg-cctgttga 
tgaaartgtt 
cattgettgt 
ccgagattag 
ctggcatcct 
agtttrgggt 
aggtcttggc 
aattcrcaga 
ttatcatgaa 
actcatgggg 
accccttctt 
ggtcaatcct 
caagagaegg 
aaagctgaag 
agccagcaca 
ctgragagar 
tttttcattt 
rcatgtatta 
cctcagtctc 
ccarttgtaa 
tattatttaa 
taaaaaaaaa 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2890 



<210> 18 

<211> 793 

<212> PRT 

<213> Glycine max 

<400> 18 

His Glu Leu His Pro Val Asn Asp Ala Tyr Gly Leu Trp Leu Thr Ser 
15 10 15 



Val He Cys Glu He Trp Phe Ala Val Ser Trp He Met Asp Gin Phe 
20 25 30 
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Pro Lys Trp Tyr fro lie Gin Arg Glu Thr Tyr Leu Asp Arg Leu Ser 
35 40 45 

Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Glu Leu Ser Ser Val Asp 
50 55 60 

Val Phe Val Ser Thr Val Asp Pro Met Lys Glu Pro Pro Leu lie Thr 
65 70 75 80 

Ala Asn Thr Val Leu Ser lie Leu Ala Val Asp Tvr Pro Val Asp Lys 
85 90 95 

Val Ala Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu 
100 105 110 

Ala Leu Ser Glu Thr Ser Glu Phe Ala Arg Arg Trp Val Pro Phe Cys 
115 120 ' ' 125 

Lys Lys Tyr Asn lie Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gly Gin 
130 135 140 

Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro Ala F.ie Val Ara Glu 
145 150 155 " 160 

Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe Lys val Arg lie Asn 
165 170 175 

Ser Leu Val Ala Thr Ala Gin Lys Val Pro Glu Asp Gly Trp Thr Met 
180 185 " 190 

Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly 
195 200 205 

Met lie Gin Val Phe Leu Gly Gin Asp Gly Val Arg Asp Val Glu Gly 
210 215 220 

Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly 
225 230 235 " 240 

Phe Asp Hrs His Lys Lys Ala Gly Ala Met Asn .Ala Leu Val Arg Ala 
245 250 255 

Ser Ala lie lie Thr Asn Ala Pro Tyr Leu Leu Asn Val Asp Cys Asp 
260 265 270 

His Tyr He Asn Asn Ser Lys Ala Leu Arg Glu Ala Met Cys Phe Met 
275 280 285 

Met Asp Pro Gin Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin 
290 295 300 

Arg Phe A.sp Gly He Asp Arg His Asp Arg Tyr Ser Asn Arg Asn Val 
305 310 315 320 

Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro 
325 330 " 335 

He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Tvr Ala Leu Tyr Gly 
340 345 350 
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Tyr Asp Ala Pro Ala Lys Lys Lys Pro Pro Ser Lys Thr Cys Asn Cys 
355 360 365 

Trp Pro Lys Trp Cys Cys Leu Cys Cys Gly Ser Arg Lys Lys Lys Asn 
370 375 380 

Ala Asn Ser Lys Lys Glu Lys Lys Arg Lys Val Lys His Ser Glu Ala 
385 390 395 400 

Ser Lys Gin lie His Ala Leu Glu Asn lie Glu Ala Glv Asn Glu Gly 
405 410 " 415 

Thr Asn Asn Glu Lys Thr Ser Asn Leu Thr Gin Thr Lys Leu Glu Lys 
420 425 430 

Arg Phe Gly Gin Ser Pro Val Phe Val Ala Ser Thr Leu Leu Asp Asp 
435 440 445 

Gly Gly Val Pro His Gly Val Ser Pro Ala Ser Leu Leu Lys Glu Ala 
450 455 460 

lie Gin Val lie Ser Cys Gly Tyr Glu Asp Lys Thr Glu Trp Gly Lys 
465 470 475 * 480 

Glu Val Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu Thr Gly 
485 490 495 



Phe Lys Men His Cys His Gly Trp 
500 

Arg Pro Ala Phe Lys Gly Ser Ala 

515 520 



Arg Ser Val Tyr Cys He Pro Lys 
505 510 

Pro He Asn Leu Ser Asp Arg Leu 
525 



His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Phe Phe Ser 
530 535 540 

Arg His Cys Pro He Trp Tyr Glv Tyr Glv Gly Gly Leu Lys Leu Leu 
545 550 555 560 

Glu Arg Phe Ser Tyr He Asn Ser Val Val Tyr Pro Trp Thr Ser Leu 
565 570 ~ 575 

Pro Leu Leu Val Tyr Cys Thr Leu Pro Ala He Cys Leu Leu Thr Gly 
580 585 590 

Lys Phe He Val Pro Glu He Ser Asn Tyr Ala Ser Leu Val Phe Met 
595 600 605 

Ala Leu Phe He Ser He Ala Ala Thr Gly He Leu Glu Met Gin Trp 
610 615 620 

Gly Gly Val Ser He Asp Asp Trp Trp Arg Asn Glu Gin Phe Trp Val 
625 630 635 640 

He Gly Gly Val Ser Ser His Leu Phe Ala Leu Phe Gin Gly Leu Leu 
645 650 655 

Lys Val Leu Ala Gly Val Asn Thr Asn Phe Thr Val Thr Ser Lys Ala 
660 665 670 
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Ala Asp Asp Gly Glu Phe Ser GIu Leu Tyr He Phe Lvs Trp Thr Ser 



675 



680 



685 



Leu Leu He Pro Pro Met Thr Leu Leu He Met Asn He Val Gly Val 



690 



695 



700 



Val Val Gly He Ser Asp Ala He Asn Asn Gly Tyr Asp Ser Trp Gl^ 



705 



710 



715 



?20 



Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val He Leu His Leu 



725 



730 



735 



[■yr Pro Phe Leu i,ys Gly Leu Leu Gly Lys Gin Asp Arg Met Pro Thi 



740 



745 



750 



He lie Leu Val Trp Ser lie Leu Leu Ala Ser He Leu Thr Leu Met 



755 



760 



765 



Trp Val Arg He Asn Pro Phe Val Ser Arg Asp Gly Pro Val Leu Gli 



770 



775 



780 



He Cys Gly ijeu Asn Cys Asp Glu Ser 
785 ' " 790 

<210> 19 

<211> 1742 

<212> DNA 

<213> Triticum aestivum 



<220> 
<221> 
<222> 



unsure 

(9) 



<220> 
<221> 
<222> 



unsure 
(271) 



<400> 19 

>wheatcong 

gtgcagttcc 

actgtctttt 

ggaactggtt 

aagaagccag 

aaaaggagct 

aatctcgaag 

ctcatgtctc 

actctgargg 

gctatccatg 

tggatctatg 

tggcgttcaa 

ctttcagatc 

agccggcatt 

gcttacatca 

ttgcctgcta 

agtatctggt 

tggagtggtg 

atctctgccc 

accaacttca 

atgttcaagt 

ggtgtcgttg 



aagctatgtg 
cacaaaggtt 
ttgatattaa 
gtgttttcaa 
gtttcttggc 
cagataagaa 
acatagagga 
aaatgagctt 
aatatggtgg 
tcataagttg 
gatctgtcac 
tctattgcat 
gtctgaatca 
gccccttatg 
acaccaccat 
tctgtctgct 
tcattgcgct 
ttggcattga 
atctgtttgc 
ctgtcacctc 
ggacgacgct 
ctggtacctc 



cttcctaatg 
tgatggtatt 
cttgaggggc 
cagaacggct 
atcattatgt 
aaagtcgaac 
gggtgrtgaa 
agagaagaga 
tgttcctcag 
tggctatgag 
agaagatatt 
gcccaagcgc 
agtgctgcgg 
gtatggctac 
ttacccacta 
cactggaaag 
cttcctttca 
cgagtggtgg 
cgtctttcag 
aaaggctaat 
tcttatccct 
ctacgccatc 



gatccaaacc 
gataggaatg 
cttgacggca 
atctatggtt 
nggggcaaga 
aagcatgtgg 
ggtgctgggt 
tttggccagt 
tcgtccactc 
gacaagtctg 
ctaactggat 
ccagctttca 
tgggctcttg 
ggagggcgcc 
acctctctcc 
ttcatcatgc 
attttcgcca 
aggaatgaac 
ggtcttctga 
gatgaagaag 
ccgacgacca 
aacagtggtt 



taggtccgca 
atcgatatgc 
ttcaaggacc 
atgagccccc 
agaaggcaag 
acagttctgt 
ttgatgatga 
cagcagcatt 
cagaatctct 
aatggggaac 
tcaagatgca 
agggatctgc 
gttctgttga 
tcaagttcct 
cgcttctagt 
cagagattag 
ctggtatcct 
agttctgggr 
aggtgcttgc 
gcgactttgc 
ttttgatcat 
accaatcatg 



agcctgttat 
aaacaggaac 
agtttatgtg 
aattaaggcg 
caagtcaaag 
tccagtattc 
gaaatcagtt 
tgttgcctcc 
tttgaaagaa 
tgagattggr 
cgcaagaggc 
ccccatcaat 
aattcttttc 
ggagagattc 
ctattgtata 
caacttggcc 
tgagatgagg 
cattggaggt 
aggtatcgac 
tgagctctac 
taacatggtc 
ggggccgctc 



60 
12C 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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tttgggaagc tcttctttgc cttctgggtg attgttcact tatacccatt cctcaagggt 1380 

cttatgggca ggcaaaaccg cacaccgacg attgtcatcg tctgggctgt cctcctcgct 1440 

tctatcttct ccttgctgtg ggttcgtgtt gatccattca ctacccgtct cgctggccca 1500 

aatatccaaa cctgtggcat caactgctag gaaagtggga gtttgtagag acagaaaata 1560 

taacagtgat cgagcgacca cctgtggagc cagagaatat ttatgntggg gttgtgaatt 1620 

actacgtttg agaaagttgt caaaattgag aaaacacatt tgtaaataga tgtaatagac 1680 

tatctaccgt trtcatgagg ttaagctctt cttttttgga aaaaaaaaaa aaaaaaaaaa 1740 

aa 1742 

<210> 20 

<211> 506 

<212> PRT 

<213> Triticum aestivum 
<220> 

<221> UNSURE 

<222> (88) 

<400> 20 

Glu Ala Met Cys Phe Leu Met Asp Pro Asn Leu Gly Pro Gin Val Cys 
15 10 15 

Tyr Val Gin Phe Pro Gin Arc Phe Asp Giv He Asd Arg Asn Asp Arg 
20 25 30 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp He Asn Leu Arg Gly Leu 
35 40 45 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn 
50 55 ' 60 

Arg Thr Ala He Tyr Gly Tyr Glu Pro Pro He Lys Ala Lvs Lys Pro 
65 70 75 * 80 

Gly Phe Leu Ala Ser Leu Cys Xaa Gly Lys Lys Lys Ala Ser Lys Ser 
85 90 ' 95 

Lys Lys Arg Ser Ser Asp Lys Lys Lys Ser Asn Lys His Val Asp Ser 
100 105 HO 

Ser Val Pro Val Phe Asn Leu Glu Asp He Glu Glu Gly Val Glu Gly 
115 120 125 

Ala Gly Phe Asp Asp Glu Lys Ser Val Leu Met Ser Gin Met Ser Leu 
130 135 140 

Glu Lys Arg Phe Gly Gin Ser Ala Ala Phe Val Ala Ser Thr Leu Met 
145 150 155 160 

Glu Tyr Gly Gly Val Pro Gin Ser Ser Thr Pro Glu Ser Leu Leu Lys 
165 170 175 

Glu Ala lie His Val He Ser Cys Gly Tyr Glu Asp Lys Ser Giu Trp 
180 185 190 

Gly Thr Giu lie Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu 
195 200 205 

Thr Gly Phe Lys Met His Ala Arg Gly Tro Arg Ser lie Tyr Cys Me<" 
210 215 220 
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Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro lie Asn Leu Ser Asp 
225 230 235 240 

Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu lie Leu 
245 250 255 

Phe Ser Arg His Cys Pro Leu Trp Tyr Gly Tyr Gly Gly Arg Leu Lys 
260 265 ' 270 

Phe Leu Glu Arg Phe Ala Tyr lie Asn Thr Thr lie Tyr Pro Leu Thr 
275 280 285 

Ser Leu Pro Leu Leu Val Tyr Cys lie Leu Pro Ala lie Cys Leu Leu 
290 295 300 

Thr Gly Lys Phe lie Met Pro Glu lie Ser Asn Leu Ala Ser lie Trp 
305 310 315 320 

Phe lie Ala Leu Phe Leu Ser lie Phe Ala Thr Gly He Leu Glu Met 
325 330 " 335 

Arg Trp Ser Gly Val Gly He Asp Glu Trp Trp Arg Asn Glu Gin Phe 
340 345 " 350 

Trp Val He Gly Gly He Ser Ala His Leu Phe Ala Val Phe Gin Gly 
355 360 365 

Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr Ser 
370 375 380 

Lys Ala Asn Asp Glu Glu Gly Asp Phe Ala Glu Leu Tyr Met Phe Lys 
385 390 395 400 

Trp Thr Thr Leu Leu He Pro Pro Thr Thr lie Leu He He Asn Met 
405 410 415 

Val Gly Val Val Ala Gly Thr Ser Tyr Ala He Asn Ser Gly Tvr Gin 
420 425 430 

Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val lie 
435 440 445 

Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg Gin Asn Arg 
450 455 460 

Thr Pro Thr He Val He Val Trp Ala Val Leu Leu Ala Ser He Pne 
465 470 475 480 

Ser Leu Leu Trp Val Arg Val Asp Pro Phe Thr Thr Arg Leu Ala Gly 
485 490 495 

Pro Asn He Gin Thr Cys Gly He Asn Cys 
500 505 

<210> 21 
<211> 1029 
<212> DNA 

<213> Triticura aestivum 
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<400> 21 

gcacgagccg 

gctcgtgccg 

tttcgcagat 

cgcctacgga 

ggagcgcctg 

tctgccacta 

ccgaattaat 

ccactaccga 

ctgtgaaatc 

cgagagagag 

tcagcttgct 

ggtcacagcg 

ctgctatgtt 

tgaatttgca 

tgagtggtac 

tagggagagg 

ggttgcaaaa 

gcctggaaa 



ctcctcacca 
tcctacatga 
cccaaccttc 
tatggcagcg 
cagcatgtca 
atggatgaag 
ccctacagga 
gtgatgcatc 
tggtttgcga 
acttacctgg 
ccaatcgact 
aacactgtcc 
tctgatgatg 
aagaaatggg 
ttccaacaga 
agggcgatga 
gcccaaaagg 



acggccagat 
gcggcggcgg 
cagtgcaacc 
tggcctggaa 
ggagcgaggg 
ctaggcagcc 
tgattatcgt 
cggcgaaaga 
tgtcctgtat 
accgtttgtc 
tctttgtcag 
tttccatcct 
gtgctgcaat 
ttcctttcag 
agatagacta 
agagagaata 
trcctgagga 



ggttgatgac 
cggcgggggc 
gagatccatg 
ggagagaarg 
tggcggtgat 
attgtccaga 
tatccggttg 
tgcatctgca 
tctrgatcag 
acraaggttt 
tacggttgat 
ttctgtggat 
gcttacgttt 
caaaaagttt 
cctgaaagac 
cgaggaattc 
aggatggaca 



atcccgccgg 
aagaggatcc 
gacccgtcca 
gagggctgga 
tgggatggcg 
aaagtcccta 
gtggttttgg 
ttgcggctca 
ttcccaaagt 
gacaaggaag 
cccacaaagg 
tatccggttg 
gaagcattgt 
aatatcgagc 
aaggttgctg 
aaggtaagga 
argcaagatg 



agcagcacgc 
acccgctccc 
aggatctggc 
agcagaagca 
acgatgcaga 
tatcatcaag 
gtttcttctt 
tatctgtaat 
ggtttccaat 
gtcaaccctc 
aacctccctt 
agaaggtctc 
ctgaaacatc 
ctcgtgctcc 
cttcatttgt 
tcaatgcctt 
gaagcccctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1029 



<210> 22 

<211> 340 

<212> PRT 

<213> Triticum aestivum 



<400> 22 

Pro Leu Leu Thr Asn Gly 
1 5 



Gin Met Val Asp Asp 
10 



lie Pro Pro Glu Gin 
15 



His Ala Leu Val Pro Ser 
20 



Tyr Met Ser Gly Gly Gly 
25 



Gly Gly Gly Lys 
30 



Arg lie His Pro Leu Pro 
35 



Phe Ala Asp Pro Asn 
40 



Leu Pro Val Gin Pro 
45 



Arg Ser Met Asp Pro Ser 
50 



Lys Asp Leu Ala A.la Tyr 
55 60 



Gly Tyr Gly Ser 



Val Ala Trp Lys Glu Arg 



65 



70 



Met Glu Gly Trp Lys 

75 



Gin Lys Gin GLj Arg 
80 



Leu Gin His Val Arg Ser 
85 



Glu Gly Gly Gly Asp 
90 



Trp Asp Gly Asp Asp 
95 



Ala Asp Leu Pro Leu Met 
100 



Asp Glu Ala Arg Gin 
105 



Pro Leu Ser Arg Lys 
110 



Val Pro lie Ser Ser Ser 
115 



Arg lie Asn Pro Tyr 
120 



Arg Met lie lie Val 
125 



lie Arg Leu Val Val Leu 
130 



Gly Phe Phe Phe His Tyr 
135 140 



Arg Val Met His 



Pro Ala Lvs Asp Ala Phe 



145 



150 



Ala Leu Trp Leu lie 
155 



Ser Val lie Cys Glu 
160 



He Trp Phe Ala Met Ser 
165 



Cys He Leu Asp Gin 
170 



Phe Pro Lys Trp Phe 
175 
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Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Phe Asp 

130 135 190 

Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro lie Asp Phe Phe Val Ser 

195 200 " 205 

Thr Val Asp Pro Thr Lys Glu Pro Pro Leu Val Thr Ala Asn Thr Val 

210 215 220 

Leu Ser lie Leu Ser Val Asp Tyr Pro Val Glu Lys Val Ser Cys Tyr 

225 230 235 240 

Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu Ala Leu Ser Glu 

245 250 255 

Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Ser Lys Lys Phe Asn 

260 265 270 

lie Glu Pro Arg Ala Pro Glu Trp Tvr Phe Gin Gin Lys lie Asp Tyr 

275 280 285 

Leu Lys Asp Lys Val Ala Ala Ser Phe Val Arg Glu Arg Ara Ala Met 

290 295 300 



Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg lie Asn Ala Leu Val Ala 

305 310 315 320 

Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met Gin Asp Gly Ser 

325 330 335 



Pro Trp Pro Gly 
340 



<210> 23 

<211> 1081 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 23 

Met Glu Ala Ser Ala Gly Leu Val 
1 5 

Leu Val Arg lie Arg His Glu Ser 
20 

Asn Met Asn Gly Gin lie Cvs Gin 

35 40 

Ala Glu Thr Gly Asp Val Phe Val 
50 55 

Val Cys Arg Pro Cys Tyr Glu Tyr 
65 70 



Ala Glv Ser Tyr Arg Ara Asn Glu 

10 15 

Asp Glv Gly Thr Lys Pro Leu Lys 

25 30 

lie Cys Gly Asp Asp Val Gly Leu 
45 

Ala Cys Asn Glu Cys Ala Phe Pro 
60 

Glu Arg Lys Asp Gly Thr Gin Cys 

75 " 80 



Cys Pro Gin Cys Lys Thr Arg Phe Arg Arg His Arg Gly Ser Pro Arg 

85 90 95 

Val Glu Gly Asp Glu Asp Glu Asp Asp Val Asd Asp lie Glu Asn Glu 

100 105 no 
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Phe Asn Tyr Ala Gin Gly Ala Asn Lys Ala Arg His Gin Arg His Gly 
115 120 125 

Glu Glu Phe Ser Ser Ser Ser Arg His Glu Ser Gin Pro lie Pro Leu 
130 135 140 

Leu Thr His Gly His Thr Val Ser Gly Giu lie Arg Thr Pro Asp Thr 
145 150 155 160 

Gin Ser Val Arg Thr Thr Ser Gly Pro Leu Gly Pro Ser Asp Arg Asn 
165 170 175 

Ala lie Ser Ser Pro Tyr lie Asp Pro Arg Gin Pro Val Pro Val Arg 
180 185 190 

lie Val Asp Pro Ser Lys Asp Leu Asn Ser Tyr Gly Leu Gly Asn Val 
195 200 205 

Asp Trp Lys Glu Arg Val Glu Gly Trp Lys Leu Lys Gin Glu Lys Asn 
210 215 220 

Met Leu Gin Met Thr Gly Lys Tyr His Giu Gly Lvs Glv Gly Glu lie 
225 230 235 * 240 

Glu Gly Thr Gly Ser Asn Gly Glu Glu Leu Gin Met Ala Asp Asp Thr 
245 250 " 255 

Arg Leu Pro Met Ser Arg Val Val Pro lie Pro Ser Ser Arg Leu Thr 
260 265 270 

Pro Tyr Arg Val Val lie lie Leu Arg Leu lie lie Leu Cys Phe Phe 
275 280 285 

Leu Gin Tyr Arg Thr Thr His Pro Val Lys Asn Ala Tyr Pro Leu Tro 
290 295 300 

Leu Thr Ser Val lie Cys Glu lie Trp Phe Ala Phe Ser Trp Leu Leu 
305 310 315 320 

Asp Gin Phe Pro Lys Trp Tyr Pro lie Asn Arg Giu Thr Tyr Leu Asp 
325 330 ' 335 

Arg Leu Ala lie Arg Tyr Asp Arg Asp Gly Glu Pro Ser Gin Leu Val 
340 345 350 

Pro Val Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro 
355 360 365 

Leu Val Thr Ala Asn Thr Val Leu Ser lie Leu Ser Val Asp Tyr Pro 
370 375 380 

Val Asp Lys Val Ala Cys Tyr Val Ser Asp Asp Glv Ser Ala Met Leu 
385 390 395 " 400 

Thr Phe Glu Ser Leu Ser Glu Thr Ala Glu Phe Ala Lys Lys Trp Val 
405 410 415 

Pro Phe Cys Lys Lys Phe Asn lie Glu Pro Arg Ala Pro Glu Phe Tyr 
420 425 430 
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Phe Ala Gin Lys lie Asp Tyr Leu rys Asp lvs lie Gin Pro Ser Phe 
435 440 445 

Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val 
450 455 460 

Arg lie Asn Ala Leu Val Ala Lys Ala Gin Lys lie Pro Glu Glu Gly 
465 470 475 480 

Trp Thr Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp 
485 490 495 

His Pro Gly Met lie Gin Val Phe Leu Gly His Ser Gly Gly Leu Asp 
500 505 510 

Thr Asp Gly Asn Glu Leu Pro Arg Leu lie Tyr Val Ser Arg Glu Lys 
515 520 525 

Arg Pro Gly Phe Gin His His Lys Lys Ala Gly Ala Met Asn Ala Leu 
530 535 540 

lie Arg Val Ser Ala Val Leu Thr Asn Gly Ala Tyr lau Leu Asn Val 
545 550 555 " 560 

Asp Cys Asp His Tyr Phe Asn Asn Ser Lys Ala lie Lys Glu Ala Met 
565 570 575 

Cys Phe Met Met Asp Pro Ala lie Gly Lys Lys Cys Cys Tyr Val Gin 
580 585 " " 590 

Phe Pro Gin Arg Phe Asp Gly lie Asp Leu His Asp Arg Tyr Ala Asn 
595 600 605 

Arg Asn lie Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp Gly lie 
610 615 620 

Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Cys Phe Asn Arg Gin Ala 
625 630 635 640 

Leu Tyr Gly Tyr Asp Pro Val Leu Thr Glu Glu Asp Leu Glu Pro Asn 
645 650 655 

lie lie Val Lys Ser Cys Cys Gly Ser Arg Lys Lys Gly Lys Ser Ser 
660 665 ' 670 

Lys Lys Tyr Asn Tyr Glu Lys Arg Arg Gly lie Asn Arg Ser Asp Ser 
675 680 685 

Asn Ala Pro Leu Phe Asn Met Glu Asp lie Asp Glu Gly Phe Glu Gly 
690 695 700 

Tyr Asp Asp Glu Arg Ser lie Leu Met Ser Gin Arg Ser Val Glu Lys 
705 710 715 720 

Arg Phe Gly Gin Ser Pro Val Phe lie Ala Ala Thr Phe Met Glu Gin 
725 730 735 

Gly Gly He Pro Pro Thr Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala 
740 745 750 
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lie His Val lie 
755 

Glu lie Gly Trp 
770 

Phe Lys Met His 
785 

Arg Pro Ala Phe 



Asn Gin Val Leu 
820 

Arg His Cys Pro 
835 



Ser Cys Gly Tyr 
760 

lie Tyr Gly Ser 
775 

Ala Arg Gly Trp 
790 

Lys Gly Ser Ala 
805 

Arg Trp Ala Leu 



lie Trp Tyr Gly 
840 



Glu Asp Lys Thr 



Val Thr Glu Asp 
780 

lie Ser lie Tyr 
795 

Pro lie Asn Leu 

810 

Gly Ser lie Glu 
825 

Tyr His Gly Arg 



Glu Trp Gly Lys 
765 

lie Leu Thr Gly 



Cys Asn Pro Pro 
800 

Ser Asp Arg Leu 
815 

lie Leu Leu Ser 
830 

Leu Arg Leu Leu 
845 



Glu Arg He Ala Tyr He Asn Thr He Val Tyr Pro He Thr Ser He 
850 855 860 

Pro Leu He Ala Tyr Cys lie Leu Pre Ala Phe Cys Leu He Thr Asd 
865 870 875 880 

Arg Phe lie He Pro Glu He Ser Asn Tyr Ala Ser lie Trp Phe He 
885 890 895 

Leu Leu Phe He Ser lie Ala Val Thr Gly He Leu Glu Leu Arg Trp 
900 905 910 

Ser Gly Val Ser He Glu Asp Trp Trp Arg Asn Glu Gin Phe Trp Val 
915 920 925 

He Gly Gly Thr Ser Ala His Leu Phe Ala Val Phe Gin Gly Leu Leu 
930 935 940 

Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr Ser Lys Ala 
945 950 955 960 

Thr Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr lie Phe Lys Trp Thr 
965 970 " " 975 

Ala Leu Leu He Pro Pro Thr Thr Val Leu Leu Val Asn Leu He Gly 
980 985 990 

He Val Ala Gly Val Ser Tyr Ala Val Asn Ser Gly Tyr Gin Ser Trp 
995 1000 1005 

Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Leu Trp Val He Ala His 
1010 1015 1020 

Leu Tyr Pro Phe Leu Lys Gly Leu Leu Gly Arg Gin Asn Arg Thr Pro 
1025 1030 " 1035 1040 

Thr He Val lie Val Trp Ser Val Leu Leu Ala Ser He Phe Ser Leu 
1045 1050 1055 

Leu Trp Val Arg He Asn Pro Phe Val Asp Ala Asn Pro Asn Ala Asn 
1060 1065 1070 
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Asn Phe Asn Gly Lys Gly Gly Val Phe 
1075 1080 



<210> 24 
<211> 1084 
<212> PRT 

<213> Arabidopsis thaliana 


















<400> 24 
Met Asn Thr 
1 


Gly 


Gly 
5 


Arg 


Leu 


He 


Ala 


Gly 
10 


Ser 


His 


Asn 


Arg 


Asn 
15 


Phe Val Leu 


He 
20 


Asn 


Ala 


Asp 


Glu 


Ser 
25 


Ala 


Arg 


He 


Arg 


Ser 

30 


Val 


Glu Leu Ser 
35 


Gly 


Gin 


Thr 


Cys 


Gin 
40 


He 


Cys 


Gly 


Asp 


Glu 
45 


He 


Glu 


Thr Val Ser 
50 


Ser 


Glu 


Leu 


Phe 
55 


Val 


Ala 


Cys 


Asn 


Glu 

60 


Cys 


Ala 


Phe 


Val Cys Arg 
65 


Pro 


Cys 


Tyr 
70 


Glu 


x y r 


r 1 n 
^_1_U 




A2TCJ" 

75 


\j± u 


u _l y 


As n 




Cys Pro Gin 


Cys 


Lys 
85 


Thr 


Arg 


Tyr 


Lys 


Arg 

90 


lie 


Lys 


Gly 


Ser 


Pro 
95 


Val Asp Gly 


Asp 
100 


Asp 


Glu 


Glu 


Glu 


Glu 
105 


Asp 


He 


Asp 


Asp 


Leu 
110 


Glu 


Glu Phe Asp 
115 


His 


Gly 


Met 


Asp 


Pro 
120 


Glu 


His 


Ala 


Ala 


Glu 
125 


Ala 


Ala 



80 



Ser Ser Arg Leu Asn Thr Gly Arg Gly Gly Leu Asp Ser Ala Pro Pro 

130 135 140 

Gly Ser Gin He Pro Leu Leu Thr Tyr Cys Asp Glu Asp Ala Asp Met 

145 150 155 * 160 

Tyr Ser Asp Arg His Ala Leu He Val Pro Pro Ser Thr Gly Tyr Gly 

165 170 175 

Asn Arg Val Tyr Pro Ala Pro Phe Thr Asp Ser Ser Ala Pro Pro Gin 

180 185 " 190 

Ala Arg Ser Met Val Pro Gin Lys Asp He Ala Glu Tyr Gly Tyr Gly 

195 200 205 

Ser Val Ala Trp Lys Asp Arg Met Glu Val Trp Lys Arg Arg Gin Gly 

210 215 220 

Glu Lys Leu Gin Val He Lys His Glu Gly Gly Asn Asn Gly Arg Gly 

225 230 235 240 

Ser Asn Asp Asp Asp Glu Leu Asp Asp Pro Asp Met Pro Met Met Asp 

245 250 255 

Glu Gly Arg Gin Pro Leu Ser Arg Lys Leu Pro He Arg Ser Ser Arg 

260 265 " 270 
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lie Asn Pro Tyr Arg Met Leu lie Leu Cys Arg Leu Ala lie Leu Gly 
275 280 285 

Leu Phe Phe His Tyr Arg lie Leu His Pro Val Asn Asp Ala Tyr Gly 
290 295 300 

Leu Trp Leu Thr Ser Val lie Cys Glu lie Trp Phe Ala Val Ser Trp 
305 310 315 320 

lie Leu Asp Gin Phe Pro Lys Trp Tyr Pro lie Glu Arg Glu Thr Tyr 
325 330 335 

Leu Asp Arg Leu Ser Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Gly 
340 345 350 

Leu Ala Pro Val Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys Glu 
355 36G " 365 

Pro Pro Leu lie Thr Ala Asn Thr Val Leu Ser lie Leu Ala Val Asp 
370 375 380 

Tyr Pro Val Asp Lys Val Ala Cys Tyr Val Ser Asp Aso Gly Ala Ala 
385 390 395 " " 400 

Met Leu Thr Phe Glu Ala Leu Ser Asp Thr Ala Glu Phe Ala Arg Lys 
405 410 415 

Trp Val Pro Phe Cys Lys Lys Phe Asn lie Glu Pro Arg Ala Pro Glu 
420 425 430 

Trp Tyr Phe Ser Gin Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro 
435 440 445 

Ala Phe Val Arg Glu Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe 
450 455 460 

Lys Val Lys lie Asn Ala Leu Val Ala Thr Ala Gin Lys Val Pro Glu 
465 470 475 " 480 

Glu Gly Trp Thr Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val 
485 490 * " 495 

Arg Asp His Pro Gly Met lie Gin Val Phe Leu Gly His Ser Gly Val 
500 505 510 

Arg Asp Thr Asp Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg 
515 520 525 

Glu Lys Arg Pro Gly Phe Asp His His Lys Lys Ala Gly Ala Met Asn 
530 535 540 

Ser Leu He Arg Val Ser Ala Val Leu Ser Asn Ala Pro Tyr Leu Leu 
545 550 555 560 

Asn Val Asp Cys Asp His Tyr lie Asn Asn Ser Lys Ala He Arg Glu 
565 570 575 

Ser Met Cys Phe Met Met Asp Pro Gin Ser Gly Lys Lys Val Cys Tyr 
580 585 590 
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Val Gin Phe Pro Gin Arg Phe Asp Gly lie Asp Arg His Asp Arg Tyr 
595 600 * 605 

Ser Asn Arg Asn Val Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp 
610 615 620 

Gly lie Gin Gly Pro lie Tyr Val Gly Thr Gly Cys Val Phe Arg Arg 
625 630 635 640 

Gin Ala Leu Tyr Gly Phe Asp Ala Pro Lys Lys Lys Lys Pro Pro Gly 
645 650 655 

Lys Thr Cys Asn Cys Trp Pro Lys Trp Cys Cys Leu Cys Cys Gly Leu 
660 665 670 

Arg Lys Lys Ser Lys Thr Lys Ala Lys Asp Lys Lys Thr Asn Thr Lys 
675 680 685 

Glu Thr Ser Lys Gin lie His Ala Leu Giu Asn Val Asp Glu Gly Val 
690 695 700 

lie Val Pro Val Ser Asn Val Glu Lys Arg Ser Giu Ala Thr Gin Leu 
705 710 715 720 

Lys Leu Glu Lys Lys Phe Giy Gin Ser Pro Val Phe Val Ala Ser Ala 
725 730 735 

Val Leu Gin Asn Giy Giy Val Pro Arg Asn Ala Ser Pro Ala Cys Leu 
740 745 750 

Leu Arg Glu Ala lie Gin Val lie Ser Cys Gly Tyr Glu Asp Lys Thr 
755 760 765 

Glu Trp Gly Lys Glu lie Gly Trp lie Tyr Gly Ser Val Thr Glu Asp 
770 775 780 

lie Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser Val Tyr 
785 "790 795 800 

Cys Met Pro Lys Arg Ala Ala Phe Lys Gly Ser Ala Pro lie Asn Leu 
805 810 815 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu 
820 825 830 

lie Phe Leu Ser Arg His Cys Pro lie Trp Tyr Gly Tyr Gly Gly Gly 
835 840 845 

Leu Lys Trp Leu Glu Arg Phe Ser Tyr lie Asn Ser Val Val Tyr Pro 
850 855 860 

Trp Thr Ser Leu Pro Leu lie Val Tyr Cys Ser Leu Pro Ala Val Cys 
865 870 " 875 880 

Leu Leu Thr Giy Lys Phe lie Val Pro Glu lie Ser Asn Tyr Ala Gly 
885 890 " 895 

lie Leu Phe Met Leu Met Phe lie Ser lie Ala Val Thr Giy He Leu 
900 905 910 
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GIu Met Gin Trp Gly Gly Val Gly He Asp Asp Trp Tro Arg Asn Glu 
915 920 ' ' 925 

Gin Phe Trp Val He Gly Gly Ala Ser Ser His Leu Phe Ala Leu Phe 
930 935 940 

Gin Gly Leu Leu Lys Val Leu Ala Gly Val Asn Thr Asn Phe Thr Val 
945 950 955 960 

Thr Ser Lys Ala Ala Asp Asp Gly Ala Phe Ser Glu Leu Tyr He Phe 
965 970 975 

Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu He lie Asn 
980 985 990 

He He Gly Val He Val Gly Val Ser Asp Ala lie Ser Asn Gly Tyr 
995 1000 1005 

Asp Ser Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val 
1010 1015 1020 

He Val His Leu Tyr Pro Pne Leu Lys Gly Met Leu Gly Lys Gin Asp 
1025 1030 " 1035 1040 

Lys Met Pro Thr He He Val Val Trp Ser He Leu Leu Ala Ser He 
1045 1050 1055 

Leu Thr Leu Leu Trp Val Arg Val Asn Pro Phe Val Ala Lys Gly Gly 
1060 1065 1070 

Pro Val Leu Glu lie Cys Gly Leu Asn Cys Gly Asn 
1075 1080 

<210> 25 
<211> 685 
<212> PRT 

<213> Gossypium hirsutum 
<400> 25 

Arg Arg Trp Val Pro Phe Cys Lys Lys His Asn Val Glu Pro Arg Ala 
15 10 15 

Pro Glu Phe Tyr Phe Asn Glu Lys He Asp Tyr Leu Lys Asp Lys Val 
20 25 30 

His Pro Ser Phe Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu 
35 40 45 

Glu Phe Lys Val Arg lie Asn Ala Leu Val Ala Lys Ala Gin Lys Lys 
50 55 60 

Pro Glu Glu Gly Trp Val Met Gin Asp Gly Thr Pro Trp Pro Gly Asn 
65 70 75 * 80 

Asn Thr Arg Asp His Pro Gly Met He Gin Val Tyr Leu Gly Ser Ala 
85 90 95 

Gly Ala Leu Asp Val Asp Gly Lys Glu Leu Pro Arg Leu Val Tyr Val 
100 105 HO 



42 



WO 00/04166 



PCT/US99/15871 



Ser Arg Glu Lys Arg Pro Gly Tyr Gin His His Lys Lys Ala Gly Ala 
115 120 125 

Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr Asn Ala Pro Phe 
130 135 140 

lie Leu Asn Leu Asp Cys Asp His Tyr lie Asn Asn Ser Lys Ala Met 
145 150 155 160 

Arg Glu Ala Met Cys Phe Leu Met Asp Pro Gin Phe Gly Lys Lys Leu 
165 170 175 

Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly lie Asp Arg His Asp 
180 185 190 

Arg Tyr Ala Asn Arg Asn Val Val Phe Phe Asp lie Asn Met Leu Gly 
195 200 205 

Leu Asp Gly Leu Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe 
210 215 220 

Asn Arg Gin Ala Leu Tyr Gly Tyr Asp Pro Pro Val Ser Glu Lys Arg 
225 230 235 240 

Pro Lys Met Thr Cys Asp Cys Trp Pro Ser Trp Cys Cys Cvs Cys Cys 
245 250 " 255 

Gly Gly Ser Arg Lys Lys Ser Lys Lys Lys Gly Glu Lvs Lys Gly Leu 
260 265 ' 270 

Leu Gly Gly Leu Leu Tyr Gly Lys Lys Lys Lys Met Met Gly Lys Asn 
275 280 285 

Tyr Val Lys Lys Gly Ser Ala Pro Val Phe Asp Leu Glu Glu lie Glu 
290 295 300 

Glu Gly Leu Glu Gly Tyr Glu Glu Leu Glu Lys Ser Thr Leu Met Ser 
305 310 315 320 

Gin Lys Asn Phe Glu Lys Arg Phe Gly Gin Ser Pro Val Phe He Ala 
325 330 335 

Ser Thr Leu Met Glu Asn Gly Gly Leu Pro Glu Gly Thr Asn Ser Thr 
340 345 350 

Ser Leu He Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu Glu 
355 360 " 365 

Lys Thr Glu Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr 
370 375 380 

Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Lys Ser 
385 390 395 400 

Val Tyr Cys Val Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro He 
405 410 " 415 

Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser 
420 425 430 
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Val Glu lie Phe Leu Ser Arg His Cys Pro Leu Trp Tyr Gly Tyr Gly 

435 440 * 445 

Gly Lys Leu Lys Trp Leu Glu Arg Leu Ala Tyr lie Asn Thr lie Val 

450 455 460 

Tyr Pro Phe Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr lie Pro Ala 

465 470 475 480 

Val Cys Leu Leu Thr Gly Lys Phe lie lie Pro Thr Leu Ser Asn Leu 
485 490 495 

Thr Ser Val Trp Phe Leu Ala Leu Phe Leu Ser lie lie Ala Thr Gly 

500 505 510 

Val Leu Glu Leu Arg Trp Ser Gly Val Ser lie Gin Asp Trp Trp Arg 

515 520 525 

Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala 

530 535 540 

Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe 

545 550 555 " 560 

Thr Val Thr Ala Lys Ala Ala Asp Asp Thr Glu Phe Gly Glu Leu Tyr 
565 570 575 

Leu Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu He He 

580 585 590 

Leu Asn Met Val Gly Val Val Ala Gly Val Ser Asp Ala He Asn Asn 

595 600 605 

Gly Tyr Gly Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 

610 615 620 



640 



Trp Val 


He 


Leu 


His 


Leu 


Tyr 


Pro 
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Lys 


Gly 


Leu 


625 








630 
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Gin Asn 


Arg 


Thr 


Pro 


Thr 


He 


Val 


Val 


Leu 


Trp 


Ser 


He 








645 










650 








Ser He 


Phe 


Ser 


Leu 


Val 


Trp 


Val 


Arg 
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Asp 


Pro 


Phe 






660 










665 










Gin Thr 


Gly 


Pro 
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Leu 


Lys 


Gin 


Cys 


Gly 
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680 
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<400> 


26 






















Met Ala 


Ser 


Thr 


Pro 
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Gin 


Thr 


Ser 


Lys 
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Val 
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1 






5 










10 








Gly Ser 


Gly 


Gin 


Thr 
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Ala 
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655 



670 



15 



20 25 30 
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Tyr Val Ser Leu Ser Arg Asp Asn lie Glu Leu Ser Giv Glu Leu Ser 
35 40 45 

Gly Asp Tyr Ser Asn Tyr Thr Val His lie Pro Pro Thr Pro Asp Asn 
50 55 60 

Gin Pro Met Ala Thr Lys Ala Glu Glu Gin Tyr Val Ser Asn Ser Leu 
65 70 75 80 

Phe Thr Gly Gly Phe Asn Ser Val Thr Arg Ala His Leu Met Asp Lys 
85 90 95 

Val lie Asp Ser Asp Val Thr His Pro Gin Met Ala Gly Ala Lys Gly 
100 105 110 

Ser Ser Cys Ala Met Pro Ala Cys Asp Gly Asn Val Met Lys Asp Glu 
115 120 125 

Arg Gly Lys Asp Val Met Pro Cys Glu Cys Arg Phe Lys lie Cvs Arg 
130 135 140 

Asp Cys Phe Met Asp Ala Gin Lys Glu Thr Glv Leu Cys Pro Gly Cys 
145 150 155 " " 160 

Lys Glu Gin Tyr Lys lie Gly Asp Leu Asp Asp Asp Thr Pro Asp Tyr 
165 170 " * 175 

Ser Ser Gly Ala Leu Pro Leu Pro Ala Pro Gly Lys Asp Gin Arg Gly 
180 185 190 

Asn Asn Asn Asn Met Ser Met Met Lys Arg Asn Gin Asn Gly Glu Phe 
195 200 205 

Asp His Asn Arg Trp Leu Phe Glu Thr Gin Gly Thr Tyr Gly Tyr Gly 
210 215 220 

Asn Ala Tyr Trp Pro Gin Asp Glu Met Tyr Gly Asp Asp Met Asp Glu 
225 230 235 240 

Gly Met Arg Gly Gly Met Val Glu Thr Ala Asp Lvs Pro Trp Arg Pro 
245 250 ' 255 

Leu Ser Arg Arg lie Pro lie Pro Ala Ala lie lie Ser Pro Tyr Arg 
260 265 270 

Leu Leu lie Val lie Arg Phe Val Val Leu Cys Phe Phe Leu Thr Trp 
275 280 285 

Arg lie Arg Asn Pro Asn Glu Asp Ala lie Trp Leu Trp Leu Met Ser 
290 295 ' 300 

lie lie Cys Glu Leu Trp Phe Gly Phe Ser Trp lie Leu Asp Gin lie 
305 310 315 320 

Pro Lys Leu Cys Pro lie Asn Arg Ser Thr Asp Leu Glu Val Leu Arg 
325 330 335 

Asp Lys Phe Asp Met Pro Ser Pro Ser Asn Pro Thr Gly Arg Ser Asp 
340 345 350 

45 



WO 00/04166 



PCT/US99/15871 



Leu Pro Gly lie Asp Leu Phe Val Ser Thr Ala Asp Pro Glu Lys Glu 
355 360 365 

Pro Pro Leu Val Thr Ala Asn Thr lie Leu Ser He Leu Ala Val Asp 
370 375 380 

Tyr Pro Val Glu Lys Val Ser Cys Tyr Leu Ser Asp Asp Gly Gly Ala 
385 390 395 400 

Leu Leu Ser Phe Glu Ala Met Ala Glu Ala Ala Ser Phe Ala Asp Leu 
405 410 415 

Trp Val Pro Phe Cys Arg Lys His Asn He Glu Pro Arg Asn Pro Asp 
420 425 430 

Ser Tyr Phe Ser Leu Lys He Asp Pro Thr Lys Asn Lys Ser Arg He 
435 440 445 

Asp Phe Val Lys Asp Arg Arg Lys He Lys Arg Glu Tyr Asp Glu Phe 
450 455 460 

Lys Val Arg He Asn Gly Leu Pro Asp Ser He Arg Arg Arg Ser Asp 
4 65 4 70 475 4 80 

Ala Phe Asn Ala Arg Glu Glu Met Lys Ala Leu Lys Gin Met Arg Glu 
485 490 495 

Ser Gly Gly Asp Pro Thr Glu Pro Val Lys Val Pro Lys Ala Thr Trp 
500 505 510 

Met Ala Asp Gly Thr His Trp Pro Gly Thr Trp Ala Ala Ser Thr Arg 
515 520 525 

Glu His Ser Lys Gly Asp His Ala Gly He Leu Gin Val Met Leu Lys 
530 535 540 

Pro Pro Ser Ser Asp Pro Leu He Gly Asn Ser Asp Asp Lys Val He 
545 550 555 * 560 

Asp Phe Ser Asp Thr Asp Thr Arg Leu Pro Met Phe Val Tyr Val Ser 
565 570 " 575 

Arg Glu Lys Arg Pro Gly Tyr Asp His Asn Lys Lys Ala Gly Ala Met 
580 585 590 

Asn Ala Leu Val Arg Ala Ser Ala He Leu Ser Asn Gly Pro Phe He 
595 600 605 

Leu Asn Leu Asp Cys Asp His Tyr He Tyr Asn Cys Lys Ala Val Arg 
610 615 620 

Glu Gly Met Cys Phe Met Met Asp Arg Gly Gly Glu Asp He Cys Tyr 
625 630 635 640 

He Gin Phe Pro Gin Arg Phe Glu Gly He Asp Pro Ser Asp Arg Tyr 
645 650 " 655 

Ala Asn Asn Asn Thr Val Phe Phe Asp Gly Asn Met Arg Ala Leu Asp 
660 665 670 
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Gly Val Gin Gly 
675 

Phe Ala Leu Tyr 
690 

Lys Lys Glu Ser 
705 

Asp Leu Asp Val 



Leu Ala Glu Ser 
740 

Asp His Pro Ala 
755 

Pro Arg Asp Pro 
770 

lie Ser Cys Trp 
785 

Trp lie Tyr Gly 



His Asn Arg Gly 
820 

Phe Arg Gly Ser 
835 

Leu Arg Trp Ala 
850 

Ala lie Leu Ala 
865 

Leu Asn Val Gly 



Cys Phe Leu Pro 
900 

Thr Leu Ser lie 
915 

Leu lie Gly Leu 
930 

Glu Glu Trp Trp 
945 

Ser His Leu Tyr 



lie Glu lie Ser 
980 



Pro Val Tyr Val 
680 

Gly Phe Asp Pro 
695 

Glu Thr Glu Ala 
710 

Thr Gin Leu Pro 
725 

lie Pro lie Ala 



Val Lys Tyr Gly 
760 

Leu Asp Ala Thr 
775 

Tyr Glu Asd Lys 
790 

Ser Val Thr Glu 
805 

Trp Arg Ser Val 



Ala Pro lie Asn 
840 

Thr Gly Ser Val 
855 

Ser Lys Arg Leu 

870 

lie Tyr Pro Phe 
885 

Ala Phe Ser Leu 



Ser Phe Leu Val 
920 

Ala Val Leu Glu 
935 

Arg Asn Glu Gin 
950 

Ala Val Val Gin 
965 

Phe Thr Leu Thr 



Gly Thr Gly Thr 



Pro Asn Pro Asp 
700 

Leu Thr Thr Ser 
715 

Lys Arg Phe Gly 
730 

Glu Phe Gin Gly 
745 

Arg Pro Pro Gly 



Thr Val Ala Glu 
780 

Thr Glu Trp Gly 
795 

Asp Val Val Thr 
810 

Tyr Cys lie Thr 
825 

Leu Thr Asp Arg 



Glu lie Phe Phe 
860 

Lys Phe Leu Gin 
875 

Thr Ser Leu Phe 

890 

Phe Ser Gly Gin 
905 

Tyr Leu Leu Met 



Val Lys Trp Ser 
940 

Trp Trp Leu lie 
955 

Gly Val Leu Lys 
970 

Thr Lys Ser Gly 
985 



Met Phe Arg Arg 
685 

Lys Leu Leu Glu 



Asp Phe Asp Pro 
720 

Asn Ser Thr Leu 
735 

Arg Pro Leu Ala 
750 

Ala Leu Arg Val 
765 

Ser Val Ser Val 



Asp Arg Val Gly 
800 

Gly Tyr Arg Met 
815 

Lys Arg Asp Ser 
830 

Leu His Gin Val 
845 

Ser Arg Asn Asn 



Arg Leu Ala Tyr 
880 

Leu lie Leu Tyr 
895 

Phe He Val Arg 
910 

He Thr He Cys 
925 

Gly He Gly Leu 



Ser Gly Thr Ser 
960 

Val He Ala Gly 
975 

Gly Asd Asp Asn 
990 



47 



WO 00/04166 



PCT/US99/15871 



Glu Asp lie Tyr Ala Asp Leu Tyr lie Val Lys Trp Ser Ser Leu Met 
995 1000 1005 

lie Pro Pro lie Val lie Ala Met Val Asn lie lie Ala lie Val Val 
1010 1015 1020 

Ala Phe lie Arg Thr lie Tyr Gin Ala Val Pro Gin Trp Ser Lys Leu 
1025 1030 1035 1040 

He Gly Gly Ala Phe Phe Ser Phe Trp Val Leu Ala His Leu Tyr Pro 
1045 1050 1055 

Phe Ala Lys Gly Leu Met Gly Arg Arg Gly Lys Thr Pro Thr He Val 
1060 1065 1070 

Phe Val Trp Ala Gly Leu He Ala He Thr He Ser Leu Leu Trp Thr 
1075 1080 1085 

Ala He Asn Pro Asn Thr Gly Pro Ala Ala Ala Ala Glu Gly Val Gly 
1090 1095 HOO 

Gly Gly Gly Pne Gin Phe Pro 
1105 1110 

<210> 27 

<211> 1026 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 27 

Met Glu Ala Ser Ala 
1 5 

Leu Val Val He His 
20 

Asp Gly Gin Phe Cys 
35 

Glu Gly Asp Leu Phe 
50 

Arg Pro Cys Tyr Glu 

65 

Gin Cys Lys Thr Arg 
85 

Gly Asp Glu Asp Glu 
100 

He Glu His Glu Gin 
115 

Tyr Gly Lys Met Ser 
130 

Arg Phe Pro Pro Val 
145 



Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
10 15 

Asn His Glu Glu Pro Lys Pro Leu Lys Asn Leu 
25 30 

Glu He Cys Gly Asp Gin lie Gly Leu Thr Val 
40 45 

Val Ala Cys Asn Glu Cys Gly Phe Pro Ala Cys 
55 60 

Tyr Glu Arg Arg Glu Gly Thr Gin Asn Cys Pro 
70 75 80 

Tyr Lys Arg Leu Arg Gly Ser Pro Arg Val Glu 
90 95 

Glu Asp lie Asp Asp He Glu Tyr Glu Phe Asn 
105 110 

Asp Lys His Lys His Ser Ala Glu Ala Met Leu 
120 125 

Tyr Gly Arg Gly Pro Glu Asp Asp Glu Asn Gly 
135 140 

lie Ala Gly Gly His Ser Gly Glu Phe Pro Val 
150 155 160 
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Gly Gly Gly Tyr Gly Asn Gly Glu His Gly Leu His Lys Arg Val His 
165 170 175 

Pro Tyr Pro Ser Ser Glu Ala Giy Ser Glu Gly Gly Trp Arg Glu Arg 
180 185 190 

Met Asp Asp Trp Lys Leu Gin His Gly Asn Leu Gly Pro Glu Pro Asp 
195 200 205 

Asp Asp Pro Glu Met Gly Leu lie Asp Glu Ala Arg Gin Pro Leu Ser 
210 215 220 

Arg Lys Val Pro lie Ala Ser Ser Lys lie Asn Pro Tyr Arg Met Val 
225 230 235 240 

lie Val Ala Arg Leu Val lie Leu Ala Val Phe Leu Arg Tyr Arg Leu 
245 250 ' 255 

Leu Asn Pro Val His Asp Ala Leu Gly Leu Trp Leu Thr Ser Val lie 
260 265 270 

Cys Glu He Trp Phe Ala Val Ser Trp He Leu Asp Gin Pne Pro Lys 
275 280 285 

Trp Phe Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg 
290 " 295 " 300 

Tyr Glu Arg Glu Gly Glu Pro Asn Met Leu Ala Pro Val Asp Val Phe 
305 310 315 320 

Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val Thr Ser Asn 
325 330 335 

Thr Val Leu Ser He Leu Ala Met Asp Tyr Pro Val Glu Lys He Ser 
340 345 350 

Cys Tyr Val Ser Asp Asp Gly Ala Ser Met Leu Thr Phe Glu Ser Leu 
355 360 365 

Ser Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lys 
370 375 380 

Phe Ser He Glu Pro Arg Ala Pro Glu Met Tyr Phe Thr Leu Lys Val 
385 390 395 400 

Asp Tyr Leu Gin Asp Lys Val His Pro Thr Phe Val Lys Glu Arg Arg 
405 410 415 

Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Gin 
420 425 430 

Val Ala Lys Ala Ser Lys Val Pro Leu Giu Gly Trp lie Met Gin Asp 
435 440 445 

Gly Thr Pro Trp Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He 
450 455 460 

Gin Val Phe Leu Gly His Ser Gly Gly Phe Asp Val Glu Gly His Glu 
465 470 475 480 
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Leu Pro Arg Leu 



His His Lys Lys 
500 

Val Leu Thr Asn 
515 

Val Asn Asn Ser 
530 

Pro Gin lie Gly 
545 

Asp Gly lie Asp 



Phe Asp xle Asn 
580 

Val Gly Thr Gly 
595 

Pro Pro Lys Gly 
610 

Pro Cys Phe Gly 
625 

Met Asn Gly Asp 



His Leu Met Phe 
660 

He Phe Val Thr 
675 

Ser Ser Pro Ala 
690 

Gly Tyr Glu Asp 
705 

Gly Ser He Thr 



Gly Trp Arg Ser 
740 

Ser Ala Pro He 
755 

Ala Leu Gly Ser 
770 

Tyr Gly Tyr Lys 
785 



Val Tyr Val Ser 
485 

Ala Gly Ala Met 



Ala Pro Phe Met 
520 

Lys Ala Val Arg 
535 

Lys Lys Val Cys 
550 

Thr Asn Asp Arg 
565 

Met Lys Gly Leu 



Cys Val Phe Lys 
600 

Pro Lys Arg Pro 
615 

Arg Arg Arg Lys 
630 

Val Ala Ala Leu 
645 

Glu Met Asn Phe 



Ser Thr Leu Met 
680 

Val Leu Leu Lys 
695 

Lys Thr Glu Trp 
710 

Glu Asp He Leu 
725 

He Tyr Cys Met 



Asn Leu Ser Asp 
760 

Val Glu He Phe 
775 

Gly Gly Lys Leu 
790 



Arg Glu Lys Arg 
490 

Asn Ala Leu Val 
505 

Leu Asn Leu Asp 



Glu Ala Met Cys 
540 

Tyr Val Gin Phe 
555 

Tyr Ala Asn Arg 
570 

Asp Gly He Gin 
585 

Arg Gin Ala Leu 



Lys Met lie Ser 
620 

Asn Lys Lys Phe 
635 

Gly Gly Ala Glu 
650 

Glu Lys Thr Phe 
665 

Glu Glu Gly Gly 



Glu Ala He His 
700 

Gly Thr Glu Leu 
715 

Thr Gly Phe Lys 
730 

Pro Lys Arg Pro 
745 

Arg Leu Asn Gin 



Phe Ser Arg His 
780 

Lys Trp Leu Glu 
795 



Pro Gly Phe Gin 
495 

Arg Val Ala Gly 
510 

Cys Asp His Tyr 
525 

Phe Leu Met Asp 



Pro Gin Arg Phe 
560 

Asn Thr Val Phe 
575 

Gly Pro Val Tyr 
590 

Tyr Gly Tyr Glu 
605 

Cys Gly Cys Cys 



Ser Lys Asn Asp 
640 

Gly Asp Lys Glu 
655 

Gly Gin Ser Ser 
670 

Val Pro Pro Ser 
685 

Val He Ser Cys 



Gly Trp He Tyr 
720 

Met His Cys Arg 
735 

Ala Phe Lys Gly 
750 

Val Leu Arg Trp 
765 

Ser Pro Leu Trp 



Arg Phe Ala Tyr 
800 
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Ala Asn Thr Thr lie Tyr Pro Phe Thr Ser lie Pro Leu Leu Ala Tyr 
805 810 815 

Cys lie Leu Pro Ala lie Cys Leu Leu Thr Asp Lys Phe lie Met Pro 
820 825 830 

Pro lie Ser Thr Phe Ala Ser Leu Phe Phe lie Ser Leu Phe Met Ser 
835 840 845 

lie lie Val Thr Gly lie Leu Glu Leu Arg Trp Ser Gly Val Ser lie 
850 855 860 

Glu Glu Trp Trp Arg Asn Glu Gin Phe Trp Val lie Gly Gly lie Ser 
865 870 875 880 

Ala His Leu Phe Ala Val Val Gin Gly Leu Leu Lys lie Leu Ala Gly 
885 890 895 

lie Asp Thr Asn Phe Thr Val Thr Ser Lys Ala Thr Asp Asp Asp Asp 
900 905 910 

Phe Gly Glu Leu Tyr Ala Phe Lys Trp Thr Thr Leu Leu lie Pro Pro 
915 920 925 

Thr Thr Val Leu lie lie Asn lie Val Gly Val Val Ala Gly lie Ser 
930 935 940 

Asp Ala lie Asn Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys 
945 950 955 960 

Leu Phe Phe Ser Phe Trp Val lie Val His Leu Tyr Pro Phe Leu Lys 
965 970 975 

Gly Leu Met Gly Arg Gin Asn Arg Thr Pro Thr lie Val Val lie Trp 
980 985 990 

Ser Val Leu Leu Ala Ser lie Phe Ser Leu Leu Trp Val Arg lie Asp 
995 1000 1005 

Pro Phe Val Leu Lys Thr Lys Gly Pro Asp Thr Ser Lys Cys Gly lie 
1010 1015 1020 

Asn Cys 
1025 

<210> 28 
<211> 701 
<212> PRT 

<213> Gossypium hirsutum 
<400> 28 

Asp Tyr Pro Val Glu Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala 
15 10 15 

Ala Met Leu Thr Phe Glu Ala Leu Ser Glu Thr Ser Glu Phe Ala Arg 
20 25 30 

Lys Trp Val Pro Phe Cys Lys Lys Tyr Asn lie Glu Pro Arg Ala Pro 
35 40 45 
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Glu Trp Tyr Phe 
50 

Thr Ser Phe Val 
65 

Phe Lys Val Arg 



Glu Glu Gly Trp 
100 

Thr Arg Asp His 
115 

Gly Leu Asp Ala 
130 

Arg Glu Lys Arg 
145 

Asn Ala Leu Val 



Leu Asn Leu Asp 

180 

Glu Ala Met Cys 
195 

Tyr Val Gin Phe 
210 

Tyr Ala Asn Arg 
225 

Asp Gly lie Gin 



Arg Thr Ala Leu 
260 

Lys Thr Gly lie 
275 

Ser Lys Ser Ser 
290 

Val Asp Ser Thr 
305 

Val Glu Gly Ala 



Met Ser Leu Glu 
340 

Thr Leu Met Glu 
355 



Ala Gin Lys lie 
55 

Lys Glu Arg Arg 

70 

Val Asn Gly Leu 
85 

lie Met Gin Asp 



Pro Gly Met lie 
120 

Glu Gly Asn Glu 
135 

Pro Gly Phe Gin 
150 

Arg Val Ser Ala 
165 

Cys Asp His Tyr 



Phe Leu Met Asp 
200 

Pro Gin Arg Phe 
215 

Asn Thr Val Phe 
230 

Gly Pro Val Tyr 
245 

Tyr Gly Tyr Glu 



Leu Ser Ser Leu 
280 

Lys Lys Gly Ser 
295 

Val Pro Val Phe 
310 

Gly Phe Asp Asp 
325 

Lys Arg Phe Gly 



Asn Gly Gly Val 
360 



Asp Tyr Leu Lys 
60 

Ala Met Lys Arg 
75 

Val Ala Lys Ala 
90 

Gly Thr Pro Trp 
105 

Gin Val Phe Leu 



Leu Pro Arg Leu 
140 

His His Lys Lys 
155 

Val Leu Thr Asn 
170 

He Asn Asn Ser 
185 

Pro Asn Leu Gly 



Asp Gly He Asp 
220 

Phe Asp He Asn 
235 

Val Gly Thr Gly 
250 

Pro Pro Leu Lys 
265 

Cys Gly Gly Ser 



Asp Lys Lys Lys 
300 

Asn Leu Glu Asp 
315 

Glu Lys Ser Leu 
330 

Gin Ser Ala Val 
345 

Pro Gin Ser Ala 



Asp Lys Val Gin 



Glu Tyr Glu Glu 
80 

Gin Lys Val Pro 
95 

Pro Gly Asn Asn 
110 

Gly Gin Ser Gly 
125 

Val Tyr Val Ser 



Ala Gly Ala Met 
160 

Gly Ala Phe Leu 
175 

Lys Ala Leu Arg 
190 

Lys Gin Val Cys 
205 

Arg Asn Asp Arg 



Leu Arg Gly Leu 
240 

Cys Val Phe Asn 
255 

Pro Lys His Arg 
270 

Arg Lys Lys Ser 
285 

Ser Gly Lys His 



lie Glu Glu Gly 
320 

Leu Met Ser Gin 
335 

Phe Val Ala Ser 
350 

Thr Pro Glu Thr 
365 
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Leu Leu Lys Glu 
370 

Thr Asp Trp Gly 
385 

Asp lie Leu Thr 



Tyr Cys Met Pro 
420 

Leu Ser Asp Arg 
435 

Glu He Leu Phe 
450 

Arg Leu Lys Trp 
465 

Pro Val Thr Ala 



Cys Leu Leu Thr 
500 

Ser He Trp Phe 
515 

Leu Lys Met Lys 
530 

Glu Gin Phe Trp 
545 

Phe Gin Gly Leu 



Val Thr Ser Lys 
580 

Met Phe Lys Trp 
595 

He Asn Leu Val 
610 

Gly Tyr Gin Ser 
625 

Trp Val He He 



Gin Asn Arg Thr 
660 

Ser He Phe Ser 
675 



Ala He His Val 
375 

Ser Glu He Gly 
390 

Gly Phe Lys Met 
405 

Lys Arg Pro Ala 



Leu Asn Gin Val 
440 

Ser Arg His Cys 
455 

Leu Glu Arg Phe 
470 

He Pro Leu Leu 
485 

Asn Lys Phe He 



He Ser Leu Phe 
520 

Trp Asn Gly Val 
535 

Val He Gly Gly 
550 

Leu Lys Val Leu 
565 

Ala Ser Asp Glu 



Thr Thr Leu Leu 
600 

Gly Val Val Ala 
615 

Trp Gly Pro Leu 
630 

His Leu Tyr Pro 
645 

Pro Thr He Val 



Leu Leu Trp Val 
680 



He Ser Cys Gly 
380 

Trp He Tyr Gly 
395 

His Ala Arg Gly 
410 

Phe Lys Gly Ser 
425 

Leu Arg Trp Ala 



Pro He Trp Tyr 
460 

Ala Tyr Val Asn 
475 

Met Tyr Cys Thr 
490 

He Pro Gin He 
505 

Leu Ser He Phe 



Gly He Asp Gin 
540 

Val Ser Ala His 
555 

Ala Gly He Asp 
570 

Asp Gly Asp Phe 
585 

He Pro Pro Thr 



Gly He Ser Tyr 
620 

Phe Gly Lys Leu 
635 

Phe Leu Lys Gly 
650 

Val Val Trp Ser 
665 

Arg lie Asp Pro 
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Tyr Glu Asp Lys 



Ser Val Thr Glu 
400 

Trp Arg Ser He 
415 

Ala Pro He Asn 
430 

Leu Gly Ser Val 
445 

Gly Tyr Ser Gly 



Thr Thr He Tyr 
480 

Leu Pro Ala Val 
495 

Ser Asn Leu Ala 
510 

Ala Thr Gly lie 
525 

Trp Trp Arg Asn 



Leu Phe Ala Val 
560 

Thr Asn Phe Thr 
575 

Ala Glu Leu Tyr 
590 

Thr Leu Leu He 
605 

Val lie Asn Ser 



Phe Phe Ala Phe 
640 

Leu Met Gly Arg 
655 

He Leu Leu Ala 
670 

Phe Thr Thr Arg 
685 
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Val Thr Gly Pro Asp Val Glu Gin Cys Giy lie Asn Cys 
690 695 700 

<210> 29 
<211> 1081 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 29 

Arg Pro Arg Leu lie Ala Gly Ser His Asn Arg Asn Glu Phe Val Leu 
15 10 15 

lie Asn Ala Asp Glu Asn Ala Arg lie Arg Ser Val Gin Glu Leu Ser 
20 25 30 

Gly Gin Thr Cys Gin He Cys Arg Asp Glu He Glu Leu Thr Val Asp 
35 40 45 

Gly Glu Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro Val Cys Arg 
50 55 60 

Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Asn Gin Ala Cys Pro Gin 
65 70 75 80 

Cys Lys Thr Arg Phe Lys Arg Leu Lys Gly Ser Pro Arg Val Glu Gly 
85 90 95 

Asp Glu Glu Glu Asp Asp lie Asp Asp Leu A.sp Asn Glu Phe Glu Tyr 
100 105 110 

Gly Asn Asn Gly He Gly Phe Asp Gin Val Ser Glu Gly Met Ser He 
115 120 125 

Ser Arg Arg Asn Ser Gly Phe Pro Gin Ser Asp Leu Asp Ser Ala Pro 
130 135 140 

Pro Gly Ser Gin He Pro Leu Leu Thr Tyr Gly Asp Glu Asp Val Glu 
145 150 155 160 

He Ser Ser Asp Arg His Ala Leu lie Val Pro Pro Ser Leu Gly Gly 
165 170 175 

His Gly Asn Arg Val His Pro Val Ser Leu Ser Asp Pro Thr Val Ala 
180 185 190 

Ala His Arg Arg Leu Met Val Pro Gin Lys Asp Leu Ala Val Tyr Gly 
195 200 205 

Tyr Gly Ser Val Ala Trp Lys Asp Arg Met Glu Glu Trp Lys Arg Lys 
210 215 220 

Gin Asn Glu Lys Leu Gin Val Val Arg His Glu Gly Asp Pro Asp Phe 
225 230 235 240 

Glu Asp Gly Asp Asp Ala Asp Phe Pro Met Met Asp Glu Gly Arg Gin 
245 250 255 

Pro Leu Ser Met Lys He Pro He Lys Ser Ser Lys He Asn Pro Tyr 
260 265 270 
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Arg Met Leu lie 
275 

Tyr Arg lie Leu 
290 

Ser Val lie Cys 
305 

Phe Pro Lys Trp 



Ser Leu Arg Tyr 
340 

Asp Val Phe Val 
355 

Thr Ala Asn Thr 
370 

Lys Val Ala Cys 
385 

Glu Ala Leu Ser 



Cys Lys Lys Tyr 
420 

His Lys Met Asp 
435 

Glu Arg Arg Ala 
450 

Asn Ala Leu Val 
465 

Met Gin Asp Gly 



Gly Met lie Gin 
500 

Asn Asn Glu Leu 
515 

Gly Phe Asp His 
530 

Val Ser Gly Val 
545 

Asp His Tyr lie 



Met Met Asp Pro 
580 



Val Leu Arg Leu 
280 

His Pro Val Lys 
295 

Glu lie Trp Phe 
310 

Tyr Pro He Glu 
325 

Glu Lys Glu Gly 



Ser Thr Val Asp 
360 

Val Leu Ser He 
375 

Tyr Val Ser Asp 
390 

Glu Thr Ala Glu 
405 

Cys He Glu Pro 



Tyr Leu Lys Asn 
440 

Met Lys Arg Asp 
455 

Ala Thr Ala Gin 
470 

Thr Pro Trp Pro 
485 

Val Phe Leu Gly 



Pro Arg Leu Val 
520 

His Lys Lys Ala 
535 

Leu Ser Asn Ala 
550 

Asn Asn Ser Lys 
565 

Gin Ser Gly Lys 



Val He Leu Gly 



Asp Ala Tyr Ala 
300 

Ala Val Ser Trp 
315 

Arg Glu Thr Tyr 
330 

Lys Pro Ser Gly 
34 5 

Pro Leu Lys Glu 



Leu Ala Val Asp 
380 

Asp Gly Ala Ala 
395 

Phe Ala Arg Lys 
410 

Arg Ala Pro Glu 
425 

Lys Val His Pro 



Tyr Glu Glu Phe 
460 

Lys Val Pro Glu 
475 

Gly Asn Ser Val 
490 

Ser Asp Gly Val 
505 

Tyr Val Ser Arg 



Gly Ala Met Asn 
540 

Pro Tyr Leu Leu 
555 

Ala Leu Arg Glu 
570 

Lys lie Cys Tyr 
585 



Leu Phe Phe His 
285 

Leu Trp Leu He 



Val Leu Asp Gin 
320 

Leu Asp Arg Leu 
335 

Leu Ser Pro Val 
350 

Pro Pro Leu He 
365 

Tyr Pro Val Asp 



Met Leu Thr Phe 
400 

Trp Val Pro Phe 
415 

Trp Tyr Phe Cys 
430 

Ala Phe Val Arg 
445 

Lys Val Lys He 



Asp Gly Trp Thr 
480 

Arg Asp His Pro 
495 

Arg Asp Val Glu 
510 

Glu Lys Arg Pro 
525 

Ser Leu lie Arg 



Asn Val Asp Cys 
560 

Ala Met Cys Phe 
575 

Val Gin Phe Pro 
590 
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Gin Arg Phe Asp 
595 

Val Val Phe Phe 
610 

Pro lie Tyr Val 
625 

Gly Phe Asp Ala 



Cys Trp Pro Lys 
660 

Lys Ala Lys Thr 
675 

Ser Lys Gin lie 
690 

Lys Val Leu Asn 
705 

Gin Lys Lys Tyr 



Glu Asn Gly Gly 
740 

Glu Ala lie Gin 
755 

Gly Lys Glu lie 
770 

Thr Gly Ser Lys 
785 

Pro Lys Leu Ala 



Arg Leu His Gin 
820 

Leu Ser Arg His 
835 

Trp Leu Glu Arg 
850 

Ser Leu Pro Leu 
865 

Thr Gly Lys Phe 



Phe Met Ala Leu 
900 



Gly He Asp Arg 
600 

Asp He Asn Met 
615 

Gly Thr Gly Cys 
630 

Pro Lys Lys Lys 
645 

Trp Cys Leu Leu 



Val Ala Ala Asp 
680 

His Ala Leu Glu 
695 

Val Glu Gin Ser 
710 

Gly Gin Ser Pro 
725 

Met Ala Arg Asn 



Val He Ser Arg 
760 

Gly Trp lie Tyr 
775 

Met His Ser His 
790 

Ala Phe Lys Gly 
805 

Val Leu Arg Trp 



Cys Pro He Trp 
840 

Leu Ser Tyr He 
855 

He Val Tyr Cys 
870 

He Val Pro Glu 
885 

Phe Ser Ser lie 



His Asp Arg Tyr 



Lys Gly Leu Asp 
620 

Val Phe Arg Arg 
635 

Lys Gly Pro Arg 
650 

Cys Phe Gly Ser 
665 

Lys Lys Lys Lys 



Asn lie Glu Glu 
700 

Thr Glu Ala Met 
715 

Val Phe Val Ala 
730 

Ala Ser Pro Ala 
745 

Gly Tyr Glu Asp 



Gly Ser Val Thr 
780 

Gly Trp Arg His 
795 

Ser Ala Pro He 
810 

Ala Leu Gly Ser 
825 

Tyr Gly Tyr Gly 



Asn Ser Val Val 
8 60 

Ser Leu Pro Ala 
875 

lie Ser Asn Tyr 
890 

Ala He Thr Gly 
905 



Ser Asn Arg Asn 
605 

Gly Leu Gin Gly 



Gin Ala Leu Tyr 
640 

Lys Thr Cys Asn 
655 

Arg Lys Asn Arg 
670 

Asn Arg Glu Ala 
685 

Gly Arg Gly His 



Gin Met Lys Leu 
720 

Ser Ala Arg Leu 
735 

Cys Leu Leu Lys 
750 

Lys Thr Glu Trp 
765 

Glu Asp He Leu 



Val Tyr Cys Thr 
800 

Asn Leu Ser Asp 
815 

Val Glu He Phe 
830 

Gly Gly Leu Lys 
845 

Tyr Pro Trp Thr 



He Cys Leu Leu 
880 

Ala Ser He Leu 
895 

He Leu Glu Met 
910 
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Gin Trp Gly Lys Val Gly lie Asp Asp Trp Trp Arg Asn Glu Gin Phe 
915 920 925 

Trp Val lie Gly Gly Val Ser Ala His Leu Phe Ala Leu Phe Gin Gly 
930 935 940 

Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe Thr Val Thr Ser 
945 950 955 960 

Lys Ala Ala Asp Asp Gly Glu Phe Ser Asp Leu Tyr Leu Phe Lys Trp 
965 970 975 

Thr Ser Leu Leu lie Pro Pro Met Thr Leu Leu lie lie Asn Val lie 
980 985 990 

Gly Val lie Val Gly Val Ser Asp Ala lie Ser Asn Gly Tyr Asp Ser 
995 1000 1005 

Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val lie lie 
1010 1015 1020 

His Leu Tyr Pro Phe Leu Lys Gly Leu Leu Gly Lys Gin Asp Arg Met 
1025 1030 1035 ' " " 1040 

Pro Thr lie lie Val Val Trp Ser lie Leu Leu Ala Ser lie Leu Thr 
1045 1050 1055 

Leu Leu Trp Val Arg Val Asn Pro Phe Val Ala Lys Gly Gly Pro lie 
1060 1065 1070 

Leu Glu lie Cys Gly Leu Asp Cys Leu 
1075 1080 
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